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Foreword to the 
Second Edition 



Much has changed in the world of fractals, computer graphics and modern mathemat- 
ics since the first edition of Fractals Everywhere appeared. The company Iterated 
Systems, Inc., founded by Michael Barnsley and Alan Sloan, is now competing in the 
image compression field with both hardware and software products that use fractal 
geometry to compress images. Indeed, there is now a plethora of texts on subjects like 
fractals and chaos, and these terms are rapidly becoming "household words." 

The fundamental approach to fractal geometry through iterated function systems 
remains sound as an introduction to the subject. This edition of Fractals Everywhere 
leaves this approach largely as it stands. One still needs a grounding in concepts in 
metric space theory and eventually (see Chapter IX) measure theory to get a working 
understanding of the subject. However, there have been several additions to help ease 
and broaden the reader's development. 

Primary to these is the addition of answers to the mathematical problems. These 
were done largely by starting at one end of the book writing the answers until the other 
cover was reached. Most of the answers found in the key have been worked over at 
least twice, in hopes improving the accuracy of the key. Every effort has been make 
to rely solely on the material presented ahead of each problem, although in a few of 
the harder problems some concepts have been introduced in the answers themselves. 
These are not considered necessary to the development of the main thread of the text; 
however, if the reader finds some areas of mathematics touched on in looking at the 
presented solutions which extend the feeling for the subject, the key has served its 
purpose. 

In addition the the answer key, there have been some other changes as well. In 
Chapter III, section 1 1, the main theorem has been qualified. The reader with more 
mathematical background will recognize that the additional Lipshitz condition 
satisfies the need for equicontinuity in Theorem 11.1. This is not the only way to 
satisfy it, just the clearest in terms of the presumed mathematical background. 
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There have been problems added to several chapters to develop the idea of 
Cartesian products of code spaces. This was done because it helps bridge the gap 
between IFS theory and the reversible systems found in physical chaos, and because 
it presents an interesting way of looking the the Random Iteration Algorithm in 
Chapter IX. The thread of these problems begins in Chapter II, leads up to the baker's 
transformation in Chapter IV, and is completed as an example in Chapter IX. 
Additional problems were added in Chapter III to develop some basic properties of 
eigenvalues and eigenvectors, which can be useful in examining dynamics both from 
the point of view described in the text, and elsewhere. It is hoped that with these 
additional tools those readers whose goals are application-oriented will come away 
with more at their disposal, while the text itself will retain its readable style. 

I would like to thank Lyman Hurd for many useful discussions about the 
topological nature of nonempty compact sets, and John Elton for his patience while 
I ran many of my new examples and problems past him to check them and to check 
the "excitement level" of the additional material. 

Hawley Rising 

It seems now that deterministic fractal geometry is racing ahead into the serious 
engineering phase. Commercial applications have emerged in the areas of image 
compression, video compression, computer graphics, and education. This is good 
because it authenticates once again the importance of the work of mathematicians. 
However, sometimes mathematicians lose interest in wonderful areas once scientists 
and engineers seem to have the subject under control. But there is so much more 
mathematics to be done. What is a useful metric for studying the contractivity of the 
vector recurrent IFS of affine maps in IR 2 ? What is the information content of a 
picture? Measures, pictures, dreams, chaos, flowers and information theory — the 
hours of the days keep rushing by: do not let the beauty of all these things pass us by too. 



Michael Fielding Barnsley 
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Chapter I 

Introduction 



Fractal geometry will make you see everything differently. There is danger in read- 
ing further. You risk the loss of your childhood vision of clouds, forests, galaxies, 
leaves, feathers, flowers, rocks, mountains, torrents of water, carpets, bricks, and 
much else besides. Never again will your interpretation of these things be quite the 
same. 

The observation by Mandelbrot [Mandelbrot 1982] of the existence of a "Geom- 
etry of Nature" has led us to think in a new scientific way about the edges of clouds, 
the profiles of the tops of forests on the horizon, and the intricate moving arrange- 
ment of the feathers on the wings of a bird as it flies. Geometry is concerned with 
making our spatial intuitions objective. Classical geometry provides a first approx- 
imation to the structure of physical objects; it is the language that we use to com- 
municate the designs of technological products and, very approximately, the forms 
of natural creations. Fractal geometry is an extension of classical geometry. It can 
be used to make precise models of physical structures from ferns to galaxies. Fractal 
geometry is a new language. Once you can speak it, you can describe the shape of a 
cloud as precisely as an architect can describe a house. 

This book is based on a course called "Fractal Geometry," which has been taught 
in the School of Mathematics at the Georgia Institute of Technology for two years. 
The course is open to all students who have completed two years of calculus. It 
attracts both undergraduate and graduate students from many disciplines, includ- 
ing mathematics, biology, chemistry, physics, psychology, mechanical engineering, 
electrical engineering, aerospace engineering, computer science, and geophysical 
science. The delight of the students with the course is reflected in the fact there is 
now a second course, entitled "Fractal Measure Theory." The courses provide a com- 
pelling vehicle for teaching beautiful mathematics to a wide range of students. 

Here is how the course in Fractal Geometry is taught. The core is Chapter II, 
Chapter III, sections ^1-5 of Chapter IV, and sections 1-3 of Chapter V. This is 
followed by a collection of delightful special topics, chosen from Chapters VI, VII, 
and VIII. The course is taught in 30 one-hour lectures. 
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Chapter II introduces the basic topological ideas that are needed to describe sub- 
sets to spaces such as IR 2 . The framework is that of metric spaces; this is adopted 
because metric spaces are both rigorously and intuitively accessible, yet full of 
suprises. They provide a suitable setting for fractal geometry. The concepts intro- 
duced include openness, closedness, compactness, convergence, completeness, con- 
nectedness, and equivalence of metric spaces. An important theme concerns proper- 
ties that are preserved under equivalent metrics. Chapter II concludes by presenting 
the most exciting idea: a metric space, denoted H, whose elements are the nonempty 
compact subsets of a metric space. Under the right conditions this space is complete, 
sequences converge, and fractals can be found! 

Chapter III deals with transformations on metric spaces. First, the goal is to de- 
velop intuition and practical experience with the actions of elementary transforma- 
tions on subsets of spaces. Particular attention is devoted to affine transformations 
and Mobius transformations in IR 2 . Then the contraction mapping principle is re- 
vealed, followed by the construction of contraction mappings on 7i. Fractals are 
discovered as the fixed points of certain set maps. We learn how fractals are gener- 
ated by the application of "simple" transformations on "simple" spaces, and yet they 
are geometrically complicated. We explain what an iterated function system (IFS) 
is, and how it can define a fractal. Iterated function systems provide a convenient 
framework for the description, classification, and communication of fractals. Two 
algorithms, the "Chaos Game" and the Deterministic Algorithm, for computing pic- 
tures of fractals are presented. Attention is then turned to the inverse problem: given 
a compact subset of IR 2 , fractal, how do you go about finding a fractal approximation 
to it? Part of the answer is provided by the Collage Theorem. Finally, the thought of 
the wind blowing through a fractal tree leads to discovery of conditions under which 
fractals depend continuously on the parameters that define them. 

Chapter IV is devoted to dynamics on fractals. The idea of addresses of points on 
certain fractals is developed. In particular, the reader learns about the metric space to 
which addresses belong. Nearby addresses correspond to nearby points on the frac- 
tal. This observation is made precise by the construction of a continuous transforma- 
tion from the space of addresses to the fractal. Then dynamical systems on metric 
spaces are introduced. The ideas of orbits, repulsive cycles, and equivalent dynam- 
ical systems are described. The concept of the shift dynamical system associated 
with an IFS is introduced and explored. This is a visual and simple idea in which the 
author and the reader are led to wonder about the complexity and beauty of the avail- 
able orbits. The equivalence of this dynamical system with a corresponding system 
on the space of addresses is established. This equivalence takes no account of the ge- 
ometrical complexity of the dance of the orbit on the fractal. The chapter then moves 
towards its conclusion, the definition of a chaotic dynamical system and the realiza- 
tion that "most" orbits of the shift dynamical system on a fractal are chaotic. To this 
end, two simple and delightful ideas are shown to the reader. The Shadow Theorem 



illustrates how apparently random orbits may actually be the "shadows" of determin- 
istic motions in higher-dimensional spaces. The Shadowmg Theorem demonstrates 
how a rottenly inaccurate orbit may be trailed by a precise orbit, which clings like a 
secret agent. These ideas are used to make an explanation of why the "Chaos Game" 
computes fractals. 

Chapter V introduces the concept of fractal dimension. The fractal dimension of 
a set is a number that tells how densely the set occupies the metric space in which it 
lies. It is invariant under various stretchings and squeezings of the underlying space. 
This makes the fractal dimension meaningful as an experimental observable; it pos- 
sesses a certain robustness and is independent of the measurement units. Various 
theoretical properties of the fractal dimension, including some explicit formulas, 
are developed. Then the reader is shown how to calculate the fractal dimension of 
real-world data, and an application to a turbulent jet exhaust is described. Lastly the 
Hausdorff-Besicovitch dimension is introduced. This is another number that can be 
associated with a set. It is more robust and less practical than the fractal dimension. 
Some mathematicians love it; most experimentalists hate it; and we are intrigued. 

Chapter VI is devoted to fractal interpolation. The aim of the chapter is to teach 
the student practical skill in using a new technology for making complicated curves 
and fitting experimental data. It is shown how geometrically complex graphs of con- 
tinuous functions can be constructed to pass through specified data points. The func- 
tions are represented by succinct formulas. The main existence theorems and com- 
putational algorithms are provided. The functions are known as fractal interpolation 
functions. It is explained how they can be readily computed, stored, manipulated 
and communicated. "Hidden variable" fractal interpolation functions are introduced 
and illustrated; they are defined by the shadows of the graphs of three-dimensional 
fractal paths. These geometrical ideas are extended to introduce space-filling curves. 

Chapter VII gives an introduction to Julia sets, which are deterministic fractals 
that arise from the iteration of analytic functions. The objective is to show the reader 
how to understand these fractals, using the ideas of Chapters III and IV. In so do- 
ing we have the pleasure of explaining and illustrating the Escape Time Algorithm. 
This algorithm is a means for computergraphical experimentation on dynamical sys- 
tems that act on two-dimensional spaces. It provides illumination and coloration, a 
seachlight to probe dynamical systems for fractal structures and regions of chaos. 
The algorithm relies on the existence of "repelling sets" for continuous transforma- 
tions which map open sets to open sets. The applications of Julia sets to biological 
modelling and to understanding Newton's method are considered. 

Chapter VIII is concerned with how to make maps of certain spaces, known 
as parameter spaces, where every point in the space corresponds to a fractal. The 
fractals depend "smoothly" on the location in the parameter space. How can one 
make a picture that provides useful information about what kinds of fractals are 
located where? If both* the space in which the fractals lie and the parameter space 
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are two-dimensional, the parameter space can sometimes be "painted" to reveal an 
associated Mandelbrot set. Mandelbrot sets are defined, and three different examples 
are explored, including the one discovered by Mandelbrot. A computergraphical 
technique for producing images of these sets is described. Some basic theorems are 
proved. 

Chapter IX is an introduction to measures on fractals and to measures in general. 
The chapter is an outline that can be used by a professor as the basis of a course in 
fractal measure theory. It can also be used in a standard measure theory course as 
a source of applications and examples. One goal is to demonstrate that measure the- 
ory is a workaday tool in science and engineering. Models for real-world images can 
be made using measures. The variations in color and brightness, and the complex 
textures in a color picture, can be successfully modelled by measures that can be 
written down explicitly in terms of succinct "formulas." These measures are desir- 
able for image engineering applications, and have a number of advantages over non- 
negative "density" functions. Section 1 provides an intuitive description of measures 
and motivates the rest of the chapter. The context is that of Borel measures on com- 
pact metric spaces. Fields, sigma-fields, and measures are defined. Caratheodory's 
extension theorem is introduced and used to explain what a Borel measure is. Then 
the integral of a continuous real-valued function, with respect to a measure, is de- 
fined. The reader learns to evaluate some integrals. Next the space V of normalized 
Borel measures on a compact metric space is defined. With an appropriate metric, 
V becomes a compact metric space. Succintly defined contraction mappings on this 
space lead to measures that live on fractals. Integrals with respect to these measures 
can be evaluated with the aid of Elton's ergodic theorem. The book ends with a de- 
scription of the application of these measures to computer graphics. 

This book teaches the tools, methods, and theory of deterministic geometry. It 
is useful for describing specific objects and structures. Models are represented by 
succinct "formulas." Once the formula is known the model can be reproduced. We 
do not consider statistical geometry. The latter aims at discovering general statistical 
laws that govern families of similar-looking structures, such as all cumulus clouds, 
all maple leaves, or all mountains. 

In deterministic geometry, structures are defined, communicated, and analyzed, 
with the aid of elementary transformations such as affine transformations, scalings, 
rotations, and congruences. A fractal set generally contains infinitely many points 
whose organization is so complicated that it is not possible to describe the set by 
specifying directly where each point in it lies. Instead, the set may be defined by 
"the relations between the pieces." It is rather like describing the solar system by 
quoting the law of gravitation and stating the initial conditions. Everything follows 
from that. It appears always to be better to describe in terms of relationships. 



Chapter II 



Metric Spaces; 
Equivalent Spaces; 
Classification of Subsets; 
and the Space of Fractals 



1 Spaces 

In fractal geometry we are concerned with the structure of subsets of various very 
simple "geometrical" spaces. Such a space is denoted by X. It is the space on which 
we think of drawing our fractals; it is the place where fractals live. What is a fractal? 
For us, for now, it is just a subset of a space. Whereas the space is simple, the fractal 
subset may be geometrically complicated. 

Definition 1.1 A space X is a set. The points of the space are the elements of 
the set. 

Although this definition does not say it, the nomenclature "space" implies that 
there is some structure to the set, some sense of which points are close to which. We 
give some examples to show the sort of thing this may mean. Throughout this text D& 
denotes the set of real numbers, and "g" means "belongs to." 

Examples 

1.1. X = DL Each "point" jc e X is a real number, or a dot on a line. 

1 .2. X = C[0, 1], the set of continuous functions that take the real closed interval 
[0, 1] = {jc G D& : 0 < jc < 1} into the real line R. A "point" / e X is a function 
/ : [0, 1] ^> D&. / may be represented by its graph. 

5 
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Figure II. 1. A point x 
in K. 




Figure 11.2. A point / 
in the space of continuous 
functions on [0, 1]. 




Notice that here / e X is not a point on the jc-axis; it is the whole function. 
A continuous function on an interval is characterized by the fact that its graph is 
unbroken; as a picture it contains no rips or tears; it can be drawn without removing 
the pencil from the paper. 

1 .3. X = R 2 , the Euclidean plane, the coordinate plane of calculus. Any pair of real 
numbers ;q, x 2 e R determines a single point in R 2 . A point jc e X is represented in 
several equivalent ways: 



x = (Xi, X2) 



(«) 



= a point in a figure such as Figure II.3. 



The spaces in examples 1.1, 1.2, and 1.3 are each linear spaces: there is an 
obviously defined way, in each case, of adding two points in the space to obtain a 
new one in the same space. In 1.1 if jc and ye I, then jc + y is also in R; in 1 .2 we 
define (/ + g)(x) = f(x) + g(x); and in 1.3 we define 



x + y = 




Similarly, in each of the above examples, we can multiply members of X by a scalar, 
that is, by a real number ael. For example, in 1.2 (af)(x) = af(x) for any a e R, 
and a/ e C[0, 1] whenever / e C[0, 1]. Example 1.1 is a one-dimensional linear 
space; 1.2 is an oo-dimensional linear space (can you think why the dimension is 
infinite?); and 1.3 is a two-dimensional linear space. A linear space is also called a 
vector space. The scalars may be complex numbers instead of real numbers. 

1 .4. The complex plane, X = C, where any point xeXis represented 



for some pair of real numbers jci, x 2 € R. Any pair of numbers x { ,x 2 eR determines 
a point of C It is obvious that C is essentially the same as U 2 , but there is an implied 
distinction. In C we can multiply two points x, y and obtain a new point in C 
Specifically, we define 



C together with the "point at infinity." Here is a way of constructing and thinking 
about C Place a sphere on the plane C, with the South Pole on the origin, and the 
North Pole N vertically above it. 

To a given point x e C we associate a point x' on the sphere by constructing 
the straight line from N to x and marking where this line intersects the sphere. 
This associates a unique point x' = h(x) with each point x e C The transformation 
h : C -> sphere is clearly continuous in the sense that nearby points go to nearby 
points. Points farther and farther away from 0 in the plane C end up closer and 
closer to N. C consists of the completion of the range of h by including N on the 
sphere: The "point at infinity (oo)" can be thought of as a giant circle, infinitely far 
out in C, whose image under h is N. It is easier to think of C being the whole of the 
sphere, rather than as the plane together with oo. It is of interest that h : C -> sphere 



x = X\ + iX2, 




x • y - (xi + ix 2 )(y\ + iyi) = (x { y { - x 2 yi) + i(x 2 yi + x { y 2 ) 



1 .5. X = C, the Riemann sphere. Formally, C = CU {oo}; that is, all the points of 



R 2 




Figure 11.3. A point x 
in the space R 2 . 
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Figure 11.4. Construc- 
tion of a geometrical rep- 
resentation for the Rie- 
mann sphere. N is the 
North Pole and corre- 
sponds to the "point at 
infinity." 




is conformal: it preserves angles. The image under h of a triangle in the plane is a 
curvaceous triangle on the sphere. 

Although the sides of the triangle on the sphere are curvaceous they meet in 
well-defined angles, as one can visualize by imagining the globe to be magnified 
enormously. The angles of the curvaceous triangle are the same as the corresponding 
angles of the triangle in the plane. 

Examples & Exercises 

1.6. X = S, the code space on AT symbols. Usually the symbols are the integers 
{0, 1, 2, . . . , N - 1}. A typical point in X is a semi-infinite word such as 

x = 2 17 0 0 1 21 15 (Af — 1) 3 0 

There are infinitely many symbols in this sequence. In general, for a given element 
x e X, we can write 



Figure 11.5. A triangle 
in the plane corresponds 
to a curvaceous triangle 
on the sphere. 



N 



x = jcijc 2 jc 3 jc4*5*6*7*8 • • • > where each e {0, 1, 2, . . . , Af - 1}. 

There are many names attached to this space because of its importance in a variety 
of branches of mathematics and physics. When each symbol is intended to represent 
a random choice from N possibilities, each point in this space represents a particular 
sequence of events, from a set of N possible events. In this case, the space is some- 
times called the space of Bernoulli trials. When there are several code spaces being 
referred to, it is customary to write the code space on N symbols as Y, N . 

1 .7. A few other favorite spaces are defined as follows. 

(a) A disk in the plane with center at the origin and with finite radius R > 0: 

f ={xe R 2 :x 2 + x 2 < R 2 }. 

(b) A "filled" square: 

■ = {x e R 2 : 0 < jci < 1, 0 < x 2 < 1}- 

(c) An interval: 

[a, b] = {x e R : a < x < b}, where a and b are real numbers with a < b. 

(d) Body space: 

= {x e R 3 : coordinate points implied by a cadaver frozen in R 3 }. 

(e) Sierpinski space 

A = {x e R 2 : x is a point on a certain fixed Sierpinski triangle}. 

Sierpinski triangles occur often in this text. See, for example, Figure IV.94. 

1.8. Show that the examples in 1.5, 1.6, and 1.7 are not vector spaces, at least if 
addition and multiplication by reals are defined in the usual way. 

1 .9. The notation A C X means A is a subset of X; that is, if x e A then x e X, or 
x e A implies x eX. The symbol 0 means the empty set. It is defined to be the set 
such that the statement "x e 0" is always false. We use the notation {x} to denote the 
set consisting of a single point x e X. Show that if x e X, then {x} is a subset of X. 

1.10. Any set of points makes a space, if we care to define it as such. The points are 
what we choose them to be. Why, do you think, have the spaces defined above been 
picked out as important? Describe other spaces that are equally important. 

1.11. Let Xj and X 2 be spaces. These can be used to make a new space denoted 
Xj x X 2 , called the Cartesian product of Xj and X 2 . A point in Xj x X 2 is repre- 
sented by the ordered pair (x u x 2 ), where x\ e Xj and x 2 e X 2 . For example, R 2 is 
the Cartesian product of R and R. 

1.12. As another example of a Cartesian product let 

X = {(x,y):x,y E E} = E x S, 




10 Chapter II Metric Spaces; Equivalent Spaces 



where E is the code space on N symbols. This has an interpretation in terms of 
the random choices mentioned in exercise 1.6. We call y the past and x the future. 
Then each element of the space represents a sequence of "coin tosses" (the coins are 
really more like N -sided dice); y represents the tosses that have already happened, 
beginning with the latest one, and x represents the tosses to come (beginning with 
the next one). If we rewrite the point (x,y) with a dot marking the "present," 

• • • ymy\ • *i*2*3 • • • 

then the act of moving the dot to the right moves one future coin toss to a past coin 
toss; the obvious interpretation is that it represents flipping the coin. Moving the dot 
is called a shift, and the space is called the space of shifts on N symbols. It is also 
denoted S, whether it is this space or code space that is being referred to is usually 
clear from context. In this book E will always be code space unless specifically 
mentioned. 



2 Metric Spaces 

We use the notation "V" to mean "for all." We also introduce the notation A \ B to 
mean the set A "take away" the set B. That is, A \ B = {x e A : x # B}. We use "=>•" 
to mean "implies." 

Definition 2. 1 A metric space (X, d) is a space X together with a real-valued 
function d : X x X IR, which measures the distance between pairs of points x and 
y in X. We require that d obeys the following axioms: 

(1) d(x,y)=d(y,x)Vx,yeX 

(2) 0 < d(x, y) < oo Vjc, y e X, x # y 

(3) d(x,x)=0 Vjc eX 

(4) d(x, y) < d(x, z) + d(z, y) Vjc, y, z e X. 

Such a function d is called a metric. 

The concept of shortest paths between points in a space, geodesies, is depen- 
dent on the metric. The metric may determine a geodesic structure of the space. 
Geodesies on a sphere are great circles; in the plane with the Euclidean metric they 
are straight lines. 

Examples & Exercises 

2. 1 . Show that the following are all metrics in the space X = IR: 

(a) d(x, y) = \x — y \ (Euclidean metric) 

(b) d(x,y)=2-\x-y\ 

(c) d(x,y) = \x 3 -y 3 \ 
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X 



Figure 11.6. (The angle 
9, and the distances r\, r 2 
used to construct a metric 
on the punctured plane.) 
Acute angle subtended by 
two straight lines. 



0 



2.2. Show that the following are metrics in the space X = R 2 : 

(a) d(x,y) = a/Ui - Ji) 2 + (*2 - J2) 2 (Euclidean metric) 

(b) d(x, y) =\x\-y\\ + \x 2 - yi\ (Manhattan metric) 

Why is the name "Manhattan" used in connection with (b)? 

2.3. Show that d(x, y) = \xy \ does not define a metric in IR. 

2.4. Let R 2 \ {0} denote the punctured plane. Define d(x, y) as follows: 

d(x,y) = \r 1 -r 2 \ + \0\, 

where r x = Euclidean distance from x to O, r 2 = Euclidean distance from y to 
O, O is the origin, and 8 is the smallest angle subtended by the two straight lines 
connecting x and y to the origin. Show that d is a metric. 

2.5. On the code space E define 



Show that every pair of points in E is a finite distance apart. That is, d is indeed 
a function that takes E x E into R. Verify that (E, d) is a metric space. Try to 
envisage a possible geometry for E. (Do not confuse the possible meanings of the 
symbol E ; from its context it should be clear when it refers to code space and when 
it refers to summation.) 

*2.6. Define X = {(x, y) : x, y e E}, the space of shifts on N symbols as in exer- 
cise 1.12. We can define a Euclidean distance by treating each coordinate as a base 
N + 1 number between 0 and 1. That is, we make the distance between (x, y) and 
(u, v) equal to 
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Show that this is indeed a metric space. 

2.7. In X =1^ define d(x , y) to be the Euclidean length of the shortest path lying 

entirely within X which connects x and y. Show that this is a metric. Discuss the 
utility of this metric in anatomy. The distance from a toenail to a fingertip does not 
depend much on the configuration of the body, whereas the usual spatial distance 
would. 

2.8. Invent a function d : ■ x ■ — > R which is not a metric. Define a metric for 
the space namely an annulus, which makes it seem like the curved wall of a 

cylinder: 

* 2.9. Show that a metric on X = C is defined by shortest great circle distances on 
the sphere. Compare the distances from 0, and from 1 + / , to oo. 

Definition 2.2 Two metrics d\ and d2 on a space X are equivalent if there exist 
constants 0 < c\ < c 2 < oo such that 

c\d\(x, y) < d2(x, y) < c 2 d\(x, y), V(x, y) e X x X. 

Examples & Exercises 

2.10. Definition 2.2 looks unsymmetrical; it does not appear to make the same 
requirements on d\ as it does on d 2 . Show that this is an illusion by establishing 
that if the definition holds then there are constants 0 < e\ < e 2 < oo so that 

e\d 2 (x, y) <d\(x,y) < e 2 d 2 (x, y), V(x, y) e X x X. 

2.11. Are the Manhattan and Euclidean metrics equivalent on ■ c i& 2 ? What about 
on [R 2 ? 

2. 12. Show that the metric in exercise 2.4 is not equivalent to the Euclidean metric 
on ®\{0}. 

One notion underlying the concept of equivalent metrics is that any pair of equiv- 
alent metrics gives the same notion of which points are close together and which 
are far apart. It is as though there were a standard way for boundedly deforming the 
space, whereby distances are determined both before and after deformation. 

For example, consider a pair of points x and y in ■ c R 2 . Let the Euclidean dis- 
tance between these points be d\(x, y). Think of a thin rubber sheet lying over ■. 
This sheet is stretched in some repeatable fashion, carrying copies of the points x 
and y to new locations, as illustrated in Figure II.7. The Euclidean distance between 
these moved points is called d 2 (x, y). The condition of equivalence is the require- 
ment that there is no extreme (infinite) stretching or compression of the space. 

This leads us to the idea of equivalent metric spaces: 

Definition 2.3 Two metric spaces (X\,d\) and (X 2 , d 2 ) are equivalent if there 
is a function h: Xi — > X 2 that is one-to-one and onto (i.e., it is invertible), such that 
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Figure 11.7. A thin rub- 
ber sheet lies over the ■ in 
the plane and is stretched. 
The Euclidean distances 
between points are de- 
termined before and af- 
ter deformation, yielding 
two metrics. These met- 
rics may be equivalent if 
the deformation leads to 
no rips, tears, or infinite 
stretching. 



Point at the corner is missing 




The point which is not 
at the corner is pulled 
to infinity. 



Xj and X2 are homeomorphic : they have the same topology. 
But they are not equivalent : their geometries are deeply different. 



the metric ~d\ on\\ defined by 

d l (x,y)=d 2 (h(x),h(y)), 

is equivalent to d\ . 



Vx, yeXi 



One can think of Definition 2.3 as requiring that Xi and X 2 are related to one 
another by a bounded deformation, and nowhere is there an arbitrarily large com- 
pression or stretching; also, there is no overlapping, folding, or ripping. 



Figure 11.8. This pic- 
ture suggests two metric 
spaces Xj and X 2 that have 
the same topology, but that 
are not metrically equiv- 
alent: their "geometries" 
are deeply different. 
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Definition 2.4 A function f : Xi X 2 from a metric space (Xi, d\) into a 
metric space (X 2 , d 2 ) is continuous if for each € > 0 and x eX\, there is a 8 > 0 
so that 



If f is also one-to-one and onto, and thus invertible, and if also the inverse f of f 
is continuous, then we say that f is a homeomorphism between Xi and X 2 . In such 
a case we say that Xi and X 2 are homeomorphic . 

The assertion that two spaces are equivalent metrically is much stronger than the 
statement that they are homeomorphic: to be equivalent there must be a bounded re- 
lationship between e and 8 independent of x. Homeomorphism is the equivalence re- 
lationship for topological properties; two spaces that are homeomorphic are identical 
topological spaces. Two metrics d\ and d 2 on a given space X are identical topolog- 
ical^ (define the same topological space) if the identity map i : (X, d\) (X, d 2 ) 
given by i(x) = x is a homeomorphism. 

Examples & Exercises 

2.13. Let Xi = [1, 2] and X 2 = [0, 1]. Let d\ denote the Euclidean and let d 2 (x, y) 
= 2 ■ \x — y \ in X 2 . Show that (Xi, d\) and (X 2 , d 2 ) are equivalent metric spaces. 

2. 14. Show that (■, Euclidean) and (■, Manhattan) are equivalent metric spaces. 

2. 15. Show that (C, Euclidean) and (R 2 , Manhattan) are equivalent metric spaces. 

2.16. Define two different metrics on the space X = (0, 1] = (^gI:0<i<1) 



Show that (X, d\) and (X, d 2 ) are not equivalent metric spaces. 

2. 1 7. Figure II.9 suggests a subset (black) of (■, Euclidean). It also shows the space 
and set deformed by a metric equivalence. Discuss the properties of the image that 
would be invariant under (a) any metric equivalence, and (b) any homeomorphism. 
To what extent might one be able to "see" these invariances? Think about how much 
deformation an image can withstand while remaining recognizably the same image. 
Look at reflections of sets and images in the back of a shiny spoon. 

2. 1 8. Show that if two metric spaces are metrically equivalent then there is a home- 
omorphism between them. 

* 2. 1 9. We can define metrics on E, our code space, by 



d l (x,y)<8^d 2 (f(x),f(y))<€. 



by 



1 

d\(x, y) = \x - y\ and d 2 (x, y) = | 



y 



l 
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where k > 1. The most important choices for k are N + 1 and N, but other choices 
(mostly real numbers between these two) are useful. Show that the functions d k : 
E x £ 1R are metrics, and that these metrics are identical topologically. 



3 Cauchy Sequences, Limit Points, Closed Sets, Perfect Sets, and Complete 
Metric Spaces 

Fractal geometry is concerned with the description, classification, analysis and ob- 
servation of subsets of metric spaces (X, d). The metric spaces are usually, but not 
always, of an inherently "simple" geometrical character; the subsets are typically 
geometrically "complicated." There are a number of general properties of subsets of 
metric spaces, which occur over and over again, which are very basic, and which 
form part of the vocabulary for describing fractal sets and other subsets of metric 
spaces. Some of these properties, such as openness and closedness, which we are 
going to introduce, are of a topological character. That is to say, they are invariant 
under homeomorphism. 



Figure 11.9. What fea- 
tures of the set (black) are 
invariant under a metric 
equivalence transforma- 
tion? Two sets that are 
metrically equivalent to 
(a) are shown in (b) and 
(c). 
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Figure 11.9. (b) 



Figure 11.9. (c) 




Fo. us what is important, however, is that there is another class of properties that 
are invariant under metric space equivalence. These include openness, closedness, 
boundedness, completeness, compactness, and perfection; these properties are intro- 
duced in this and the next section. Later we will discover another such property, the 
fractal dimension of a set. If a subset of a metric space has one of these properties, 
and the space is deformed with bounded distortion, then the corresponding subsets 
in the deformed space still have that same property. 

We are also about another business in this section. In our search for fractals we 
are always going to look in a certain type of metric space known as "complete." We 
need to understand this concept. 

Definition 3.1 A sequence {x n }^Li of points in a metric space (X, d) is called a 
Cauchy sequence if, for any given number e > 0, there is an integer N > 0 so that 

d{x n ,x m )<e for all «, m > N. 

In other words, the further along the sequence one goes, the closer together be- 
come the points in the sequence. Mentally one pictures something like the image in 
Figure 11.10. 

However, just because a sequence of points moves closer together as one goes 
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Figure 11.10. Image 




representing succes- 
sive magnifications on 
a Cauchy sequence, an 
infinite sequence of points 
in X. Just because the 
points are getting closer 
and closer together as one 
looks in at higher mag- 



nification does not mean 
that there is a point x to 
which the sequence is 
converging! 



along the sequence, we must not infer that they are approaching a point. Perhaps 
they are trying to approach a point that is not there? 

Definition 3.2 A sequence {x n }'^L. l of points in a metric space (X, d) is said to 
converge to a point x eXif for any given number e > 0, there is an integer N > 0 
so that 



d(x„, x) < € 



for all n > N. 



In this case the point x eX, to which the sequence converges, is called the limit of 
the sequence, and we use the notation 

x = lim x„. 

n-±oo 

The limit x of a convergent sequence ix,,}™^ has this property: let 

B(x,€) = {y eX:d(x,y) <€} 

denote a closed ball of radius e > 0 centered at x, as illustrated in Figure 11.11. 

Any such ball centered at x contains all of the points x n after some index N, 
where N typically becomes larger and larger as e becomes smaller and smaller. See 
Figure 11.12. 

Theorem 3. 1 If a sequence of points [x n }^L l in a metric space (X, d) converges 
to a point x eX, then [x„}^L l is a Cauchy sequence. 



(a) 



(b) 





(d) 



Figure 11.11. Uncele- 
brated small ball B(x, e) 
with its center at x and 
radius €. Beware! Balls 
do not usually look like 
balls. It depends on the 
metric and on the space. 
Balls (a)-(c) represent 
balls (marked in black) in 
spaces X that are subsets 
of R 2 , with the Euclidean 
metric. In (a) X has a 
ragged boundary, viewed 
as a subset of R 2 . In (b) the 
point JC€Xis isolated. In 
(c) X is a curvaceous Sier- 
pinski triangle. The ball 
depicted in (d) is in R 2 , 
but the metric is d(x, y) = 
max { |*i - yi|, |*2 ~ yi\}. 



18 Chapter II Metric Spaces; Equivalent Spaces 



Figure 11.12. Magni- 
fying glass looking at a 
magnifying glass near a 
limit point. 




ZOOM IN 



ZOOM IN 



FOREVER 



more points of 
the set become 
) visible" as we 



zoom 



Definition 3.3 A metric space (X, d) is complete if every Cauchy sequence 
{x n }™ = \ in X has a limit xeX. 

In other words, there actually exists, in the space, a point x to which the Cauchy 
sequence is converging. This point x is of course the limit of the sequence. If {x n }™ = i 
is a Cauchy sequence of points in X and if X is complete, then there is a point x eX 
such that, for each e > 0, B(x, e) contains x n for infinitely many integers n. 

We will sometimes use the notation {x n } in place of {x n }™ =l and lim in place of 
lim„^oo when it is clear from the context what the domain of the index is. 

Examples & Exercises 

3. 1 . Prove that if {x n }™ =l is a Cauchy sequence of points in X and if X is complete, 
then there is a point x e X such that, for each € > 0, B(x, e) contains x n for infinitely 
many integers n. 

* 3.2. Show that (R, Euclidean metric) is a complete metric space. 

3.3. Show that (IR 2 , Euclidean metric) is a complete metric space. 

3.4. Show that (■, Euclidean metric) is a complete metric space. 

3.5. Show that (C, metric on sphere) is a complete metric space. 

3.6. Show that (£, code space) is a complete metric space. 

3.7. (C[0, 1], D) is a complete metric space, where the metric D is defined by 



3.8. Let (Xj, d x ) and (X 2 , d 2 ) be equivalent metric spaces. Suppose (Xj, d x ) is com- 
plete. Show that (X 2 , d 2 ) is complete. 

3.9. Show that there are many different "shortest paths" between most pairs of 
points in (■, Manhattan). 

3. 1 0. Prove Theorem 3.1. 

3.11. Prove that any sequence in a metric space can have at most one limit. 



D(f, g) = max{|/(s) - g(s)\ :se[0, 1]}. 
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Definition 3.4 Let S cXbe a subset of a metric space (X, d). A point x e X is 
called a limit point of S if there is a sequence {x n }^L l of points x n e 5 \ {x} such that 
lim^^gQ x n —X. 

Definition 3.5 Let S cXbe a subset of a metric space (X, d). The closure ofS, 
denoted S, is defined to be 5 = 5 U {Limit points of S}. S is closed if it contains all 
of its limit points, that is, 5 = 5.5 is perfect // it is equal to the set of all its limit 
points. 

Exercises & Examples 

3.12. Show that 0 is a limit of the sequence {x n = \}™ =l in the metric space ([0, 1], 
Euclidean) but not in the metric space ((0, 1], Euclidean). 

3.13. A metric space (X,d) consists of a single point X = {a}, together with a 
metric defined by d(a, a) = 0. Show that X contains a Cauchy sequence and the 
limit of the Cauchy sequence, but that it possesses no limit points. Hence show that 
X is closed and complete but not perfect. 

3.14. Show that the sequence {x n = n}™ =l has no limit in (R, Euclidean), but that it 
does when the points are treated as belonging to (C, spherical). 

3.15. Show that if h : X\ — ► X2 makes the metric spaces (X\,di) and (X 2 ,^2) 
metrically equivalent, then the statements "x e Xi is a limit point of 5 c Xf and 
"h(x) € X2 is a limit point of h(S) c X2" are equivalent. Here we use the notation 
h(S) = {h(s):s € 5}. 

3.16. Find all of the limit points of the set {x n = (£ + (-1)", £ + (-1) 2 ") : n = 
1, 2, 3, . . .} in the metric space (■ , Euclidean). 

3.17. Show that the subset 5 = {jc = £ : n = 1, 2, 3, . . .} is closed in ((0, 1], Eu- 
clidean) . 

3.18. Show that 5 = [0, 1] is a perfect subset of (R, Euclidean). 

3.19. Show that 5 = : n = 1, 2, 3, . . .} U {0} is not a perfect subset of (R, Eu- 
clidean), but that 5 = 5. 

3.20. Show that 5 = E is a perfect subset of (E, code space metric). 

3.21 . Let 5 be a subset of a complete metric space (X, d). Then (5, d) is a metric 
space. Show that (5, d) is complete if, and only if, 5 is closed in X. 



4 Compact Sets, Bounded Sets, Open Sets, and Boundaries 

We continue the description of basic properties that are to be used to describe sets 
and subsets of metric spaces. Where are the fractals? What are they? They are ev- 
erywhere and soon you will be able to see them. Not just the pictures, which are 
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shadows of fractals, but in your mind's eye you will find what and where they really 
are. 

Definition 4.1 Let S C X be a subset of a metric space (X, d). S is compact if 
every infinite sequence {jc„}^? =1 in S contains a subsequence having a limit in S. 

Definition 4.2 Let S C X be a subset of a metric space (X, d). S is bounded if 
there is a point a e X and a number R > 0 so that 

d{a, x) < RVx e X. 

Definition 4.3 Let S C X be a subset of a metric space (X, d). S is totally 
bounded if for each € > 0, there is a finite set of points {yi, y 2 , . . . , y n ) C 5 such 
that whenever x e X, d(x, y,) < e for some y,- e {yi, y 2 , • • ■ , y n }- This set of points 
{yi, >2i • • • i is called an e-net. 

Theorem 4.1 Let (X,d) be a complete metric space. Let S C X. Then S is 
compact if and only if it is closed and totally bounded. 

Proof Suppose that 5 is closed and totally bounded. Let {x t e S) be an infinite 
sequence of points in 5. Since 5 is totally bounded we can find a finite collection of 
closed balls of radius 1 such that 5 is contained in the union of these balls. By the 
Pigeon-Hole Principle (a huge number of pigeons laying eggs in two letter boxes => 
at least one letter box contains a huge number of angry pigeons), one of the balls, say 
B\, contains infinitely many of the points x n . Choose N\ so that x N] e B\. It is easy to 
see that B\ n 5 is totally bounded. So we can cover B\ D 5 by a finite set of balls of 
radius 1 /2. By the Pigeon-Hole Principle, one of the balls, say B 2 , contains infinitely 
many of the points x n . Choose N 2 so that x Nl e B 2 and N 2 > N\. We continue in this 
fashion to construct a nested sequence of balls, 

B X D B 2 D B 3 D B 4 D B 5 D B 6 D B 7 D B S D B 9 D-D B n D-- 

where B n has radius and a sequence of integers {AU^U such that x Nn e B n . It 
is easy to see that {x N J%L v which is a subsequence of the original sequence {x n }, is 
a Cauchy sequence in 5. Since 5 is closed, and X is a complete metric space, 5 is 
complete as well; see exercise 4.2. So {jc„} converges to a point x in 5. (Notice that 
jc is exactly HSli B n ) Thus, 5 is compact. 

Conversely, suppose that 5 is compact. Let e > 0. Suppose that there does not 
exist an e-net for 5. Then there is an infinite sequence of points {x„ e S) with 
d(xi, xj) >e for all i ^ j. But this sequence must possess a convergent subsequence 
{x N .}. By Theorem 3.1 this sequence is a Cauchy sequence, and so we can find a pair 
of integers N { and AT, with N { 7^ AT, so that d(x Ni , x Nj ) < e. But d(x N .,x Nj ) > €, so 
we have a contradiction. Thus there does exist an 6 -net. This completes the proof. 

Definition 4.4 Let S C X be a subset of a metric space (X, d). S is open if for 
each x e S there is an e > 0 such that B(x, e) = [y e X : d(x, y) < e} C 5. 
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Figure 11.13. A trans- 
formation 0 between two 
metric spaces, establish- 
ing the equivalence of the 
spaces and carrying the 
closed set S onto a closed 
set 0(5). 



Examples & Exercises 

4.1. Show that if (X, d) is a metric space then X is closed. Give an example of a 
metric space that is closed but not complete. 

4.2. Let S be a closed subset of a complete metric space (X, d). Show that (S, d) 
is a complete metric space. 

4.3. Let (Xi, di) and (X 2 , di) be equivalent metric spaces, and let a transformation 
9 : Xi X 2 provide this equivalence. Let 5cX ( be closed. Show that 9(S) — 
{9(s) : 5 e S} is closed. This idea is illustrated in Figure 11.13. 

4.4. If (X, d) is a metric space, then X is open. 
Proof Let* g X. Clearly B(x, 1) c X. 

4.5. If (X, d) is a metric space, then "S c X is open" is the same as "X \ S is 
closed." 

Proof Suppose "S c X is open." Suppose [x n } is a sequence in X \ S with a limit 
x g X. We must show that x e X \ S. Assume that x e S. Then every ball B(x, e) 
with e > 0 contains a point x n e X \ S, which means that S is not open. This is 
a contradiction. The assumption is false. Therefore x e X \ S. Therefore "X \ S is 
closed." 

Suppose "X \ S is closed." Let x e S. We want to show there is a ball B(x,e) C S. 
Assume there is no ball B(x, e) c S. Then for every integer n = 1, 2, 3, . . ., we can 
find a point x n e B(x, ^) D (X \ S). Clearly {x n } is a sequence in X \ S, with limit 
x g X. Since X \ S is closed we conclude that x e X \ S. This contradicts x e S. 
The assumption that there is no ball B(x, e) c S is false. Therefore there is a ball 
B(x, e) c S. Therefore "S is open." 

4.6. Every bounded subset S of (R 2 , Euclidean) has the Bolzano- Weierstrass prop- 
erty: "Every infinite sequence {x n }^ =l of points of S contains a subsequence which 
is a Cauchy sequence." The proof is suggested by the picture in Figure 11.14. 

We deduce that every closed bounded subset of (IR 2 , Euclidean) is compact. In 
particular, every metric space of the form (closed bounded subset of IR 2 , Euclidean) 
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Figure 11.14. Demon- 
stration of the Bolzano- 
Weierstrass Theorem. 
(Government warning: 
This is not a proof.) 
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is a complete metric space. Show that we can make a rigorous proof by using Theo- 
rem 4.1. Begin by proving that any bounded subset of R n is totally bounded. 

4.7. Let (X, d) be a metric space. Let / : X X be continuous. Let A be a compact 
nonempty subset of X. Show that f(A) is a compact nonempty subset of X. (This 
result is proved later as Lemma 7.2 of Chapter III.) 

4.8. Let S C (Xi, dx) be open, and let (X 2 , d 2 ) be a metric space equivalent to 
(Xi ,d\), the equivalence being provided by a function h : Xi -> X 2 . Show that h(S) 
is an open subset of X 2 . 

4.9. Let (X, d) be a metric space. Let C C X be a compact subset of X. Let {C„ : 
n = 1, 2, 3, . . .} be a set of open subsets of X such that "x e C" implies "x e C n 
for some «." {C„} is called a countable open cover of C. Show that there is a finite 
integer N so that "jc e C" implies "jc e C„ for some integer n < N" 

Proof Assume that an integer N does not exist such that "x e C" implies e C„ 
for some n < N" Then for each N we can find 
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N 

eC\\Jc n . 

Since {x N }™ =] is in C it possesses a subsequence with a limit y e C. Clearly y does 
not belong to any of the subsets C n . Hence "y e C" does not imply "y e C n for some 
integer We have a contradiction. This completes the proof. 

The following even stronger statement is true. Let (X, d) be a metric space. Let 
C c X be compact. Let {C, : / e /} denote any collection of open sets such that 
whenever x e C, it is true that x e C, for some index i e I. Then there is a finite sub- 
collection, say {Ci , C 2 , C n } such that C C UjL, c < • The P oint is that the or ig maI 

collection of open sets need not even be countably infinite. A good discussion of 
compactness in metric spaces can be found in [Mendelson 1963], Chapter V. 

4.10. Let X = (0, 1) U {2}. That is, X consists of an open interval in R, together 
with an "isolated" point. Show that the subsets (0, 1) and {2} of (X, Euclidean) are 
open. Show that (0, 1) is closed in X. Show that {2} is closed in X. Show that {2} is 
compact in X but (0, 1) is not compact in X. 

Definition 4.5 Let S czXbe a subset of a metric space (X, d). A point x e X is 
a boundary point of S if for every number € > 0, B(x, e) contains a point inX \ S 
and a point in 5. The set of all boundary points of S is called the boundary of S and 
is denoted 35. 

Definition 4.6 Let S C X be a subset of a metric space (X,d). A point x e S is 
called an interior point of S if there is a number € > 0 such that B(x, e) C S. The set 
of interior points of S is called the interior of S and is denoted S°. 

Examples & Exercises 

4.1 1 . Let S be a subset of a metric space (X, d ). Show that 35 = 3(X \ S). Deduce 
that 3X = 0. 

4.12. Show that the property of being a boundary of a set is invariant under metric 
equivalence. 

4.13. Let (X, d) be the real line with the Euclidean metric. Let S denote the set of 
all rational points in X (i.e., real numbers that can be written | where p and q are 
integers with q ^ 0). Show that 95 = X. 

4.14. Find the boundary of C viewed as a subset of (C spherical metric). 

4.15. Let S be a closed subset of a metric space. Show that 35 c 5. 

4.16. Let 5 be an open subset of a metric space. Show that 35 Pi 5 = 0. 

4.17. Let 5 be an open subset of a metric space. Show that 5° = 5. Conversely, 
show that if 5° = 5 then 5 is open. 

4.18. Let 5 be a closed subset of a metric space. Show that 5 = 5° U 35. 
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Figure 11.15. How well 
can topological concepts 
such as open, boundary, 
etc., be used to model 
land, sea, and coastlines? 




Metric Space X, the world 



The coastline is 
the boundary of 
the set called LAND 
and the set called 
SEA 

The land is the interior 
of the island. 



The wet stuff is the interior of the sea 



4. 1 9. Show that the property of being the interior of a set is invariant under metric 
equivalence. 

4.20. Show that the boundary of a set S in a metric space always divides the space 
into two disjoint open sets whose union, with the boundary dS, is the whole space. 
Illustrate this result in the following cases, in the metric space (1R 2 , Euclidean): (a) 

S = {(x, y) € R 2 : x 2 + y 2 < 1}; (b) S = K 2 . 

4.21 . Show that the boundary of a set is closed. 

^4.22. Let S be a subset of a compact metric space. Show that dS is compact. 

4.23. Figure 11.15 shows how we think of boundaries and interiors. What features 
of the picture are misleading? 

4.24. To what extent does Mercator's projection provide a metric equivalence to a 
Cartesian map of the world? 

4.25. Locate the boundary of the set of points marked in black in Figure 11.16. 

4.26. Prove the assertion made in the caption to Figure 11.17. 



5 Connected Sets, Disconnected Sets, and Pathwise-Connected Sets 

Definition 5.1 A metric space (X, d) is connected if the only two subsets ofX 
that are simultaneously open and closed are X and 0. A subset S cX is connected 
if the metric space (S, d) is connected. S is disconnected if it is not connected. S 
is totally disconnected provided that the only nonempty connected subsets of S are 
subsets consisting of single points. 

Definition 5.2 Let S C X be a subset of a metric space (X,d). Then S is 
path wise-connected if, for each pair of points x and y in S, there is a continuous 
function f : [0, 1] — >• S, from the metric space ([0, 1], Euclidean) into the metric 
space (S,d), such that f(0)=x and f(l) = y. Such a function f is called a path 
from x to y in S. S is pathwise-disconnected if it is not pathwise-connected. 
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Figure 11.16. Should 
the black part be called 
open and the white part 
closed? Locate the bound- 
ary of the set of points 
marked in black. 




Figure 11.17. The in- 
terior of the "land" set is 
an open set in the met- 
ric space (Y = mm , 
Euclidean). The smaller 
filled rectangle denotes a 
subset Z =■ of Y. The 
intersection of the inte- 
rior of the land with Z is 
an open set in the met- 
ric space (Z, Euclidean), 
despite the fact that it in- 
cludes some points of the 
"border" of ■. 
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One can also define simply connected and multiply connected. Let S be pathwise- 
connected. A pair of points x, y e S is simply connected in S if, given any two paths 
/o and /i connecting x, y in 5, we can continuously deform / 0 to f\ without leaving 
the subset S. What does this mean? 

Let there be the two points x, y e S and the two paths /o, f\ connecting x, y in 
S. In other words, /o, f\ are two continuous functions mapping the unit interval 
[0, 1] into S so that /o(0) = /i(0) = x and / 0 (l) = /i(l) = y. By a continuous 
deformation of /o into f\ within S we mean a function g continuously mapping the 
Cartesian product [0, 1] x [0, 1] into 5, so that 

(a) *(j,0) = /o(j)(0<j<1) 

(b) *(j,1) = /i(j)(0<j<1) 

(c) *(0,0=*(0<f <D 

(d) g(l,t) = y(0<t<l) 

Thus, we say that two points x, y in S are simply connected in 5 if, given any two 
paths /o, /i going from x to y in 5, there exists a function g as just described. This 
idea is illustrated in Figure 11.18. 

If x, y are not simply connected in 5, then we say that x, y are multiply connected 
in S. S itself is called simply connected if every pair of points x, y in S is simply 
connected in S. Otherwise, S is called multiply connected. In the latter case we can 
imagine that S contains a "hole," as illustrated in Figure 11.19. 

Examples & Exercises 

* 5. 1 . Show that the properties of being (pathwise) connected, disconnected, sim- 
ply connected, and multiply connected are invariant under metric equivalence. 

5.2. Show that the metric space (■, Euclidean) is simply connected. 



Figure 11.18. A path f 0 
which connects the points 
x and y is continuously 
deformed, while remain- 
ing "attached" to x and 
y, to become a second 
path 

X=g(0.t) 




fl(s)=g(s,i) 
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5.3. Show that the metric space (X = (0, 1) U {2}, Euclidean) is disconnected. 

5.4. Show that the metric space (E, code space metric) is totally disconnected. 
*5.5. Show that the metric space (^J. Manhattan) is multiply connected. 

5.6. Suppose S\ D S 2 D • • • D S n D • • • is a nested sequence of nonempty connected 
subsets. Is pl^Li S n necessarily connected? 

5.7. Identify pathwise connected subsets of the metric space suggested in Fig- 
ure 11.20. 

5.8. Is ( Euclidean) simply or multiply connected? 

5.9. Discuss which set-theoretic properties (open, closed, connected, compact, 
bounded, etc.) would be best suited for a model of a cloud, treated as a subset of 

5.10. The property that {jc„}~ 1 is a Cauchy sequence in the metric space (X, d) 
is not invariant under homeomorphism but is invariant under metric equivalence, as 
illustrated in Figure 11.21. 



6 The Metric Space Cft(X), h): The Space Where Fractals Live 

We come to the ideal space in which to study fractal geometry. To start with, and 
always at the deepest level, we work in some complete metric space such as (IR 2 , 
Euclidean) or (C, spherical), which we denote by (X, d). But then, when we wish to 
discuss pictures, drawings, "black-on-white" subsets of the space, it becomes natural 
to introduce the space H. 

Definition 6.1 Let (X, d) be a complete metric space. Then H{X) denotes the 
space whose points are the compact subsets ofX, other than the empty set. 

Examples & Exercises 

6. 1 . Show that if x and y e H(X) then x U y is in H(X). Show that xHy need not 
be in H(X). A picture of this situation is given in Figure 11.22. 

6.2. What is the difference between a subset of Ti(X) and a compact nonempty 
subset of X? 

Definition 6.2 Let (X, d) be a complete metric space, x <=X, and B e H(X). 
Define 

1 d(x,B) = min{d(x,y):y e B}. 
d(x, B) is called the distance from the point x to the set B. 
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Figure 11.19. In a mul- 
tiply connected space 
there exist paths that 
cannot be continuously 
deformed from one to an- 
other. There is some kind 
of "hole" in the space. 



Figure 11.20. Locate 
the largest connected 
subsets of this subset 
of R 2 . 
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tTfe dot at the corner 
^ '] is missing 



homeomorphism 




s-t-r-e : t-c-hTo od ~"1 
► 



Figure 11.21. ACauchy 
sequence being preserved 
by a metric equivalence 
and destroyed by a certain 
homeomorphism. 



(X,d) contains a 
Cauchy Sequence 




metric^ — 
equivalence 




(X,d) Cauchy Sequence 
destroyed ! 



(X, d) Cauchy Sequence 
survived ! 



How do we know that the set of real numbers {d(x, y) : y e B] contains a min- 
imum value, as claimed in the definition? This follows from the compactness and 
nonemptyness of the set B e H(X). Consider the function / : B -> R defined by 



From the definition of the metric it follows that / is continuous, viewed as a trans- 
formation from the metric space (B, d) to the metric space (R, Euclidean). Let 
P = inf{/(j) : y e B}, where "inf" is defined in exercise 6.19, and also in Defi- 
nition 6.2 in Chapter III. Since f(y)> 0 for all y e B, it follows that P is finite. We 
claim there is a point y e B such that d(x , y) = P. We can find an infinite sequence 
of points {y n : n = 1, 2, 3, . . .} C B such that f(y n ) - P < £ for each positive in- 
teger n. Using the compactness of B, we find that {y n : n = 1, 2, 3 . . .} has a limit 
y e B. Using the continuity of / we discover that f{y) = P, which is what we 
needed to show. 

Color Plate 2 shows a picture of the metric space (■, Manhattan). It has been 
colored as follows. Let T denote a certain subset of ■ whose "geometry" is that of a 
piece of a fern. Then the color of each point a e m is fixed by the value of d(a, T). 

Definition 6.3 Let (X, d) be a complete metric space. Let A, B e H(X). Define 



d(A, B)is called the distance from the set AeH(X) to the set B e H{X). 

Just as above, using the compactness of A and B, we can prove that this definition 
is meaningful. In particular, there are points x e A and y e B such that d(A, B) = 
d(x, >•). 

Examples & Exercises 

6.3. Show that B,C e H(X), with BcC, implies d(x, C) < d(x, B). 



f(y)=d(x,y) 



for all y e B. 



d(A, B) = max{d(x, B) : x e A}. 
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6.4. Calculate d(x, B) if (X, d) is the space (0& 2 , Euclidean), x e 0& 2 is the point 
(1, 1), and B is a closed disk of radius ~ centered at the point (j, 0). 

6.5. Same as 6.4 above, but use the Manhattan metric. 

6.6. Calculate d(x, B) if (X, d) is (R, Euclidean), x = |, and 

B = [x n = 3 + (-1)"-TTT : « = 1, 2, 3, . . .} U {3}. 
n l + 1 

6.7. Let A, B eH(X), where (X, d) is a metric space. Show that, in general, 
d(A, B) ^ d(B, A). Conclude that d does not provide a metric on H(X). It is not 
symmetrical: the distance from A to B need not equal the distance from B to A. 

6.8. Figure 11.23 shows two subsets A and B of (■ c 0& 2 , Euclidean). A is the white 
part and B is the black part, (a) Estimate the location of a pair of points, x e A and 
y e B, such that d(x, y) = d(A, B). (b) Estimate the location of a pair of points, 
x e A and y e B, such that d(x, y) — d(B, A). 

6.9. Figure 11.24 shows two fern-like subsets, A and B, of (0& 2 , Manhattan). Locate 
points x e A and y € B such that: (a) d(x, y) = d(A, B); (b) d(x, y) = d(B, A). 



Figure 11.22. Points in 
the space "H(R 2 ) may be 
interpreted as black-and- 
white images. Unions of 
points yield new points. 
Be careful with intersec- 
tions, however. 




1 The whole smiley 
I face is a point 
| in H(X). 
j Call it x e H(X). 




This thin torso 
is a point 
in H(X). 
Call it y< H(X). 




This fellow is 

x j y. 
He is a single 
point in H(X). 
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Figure 11.23. This frac- 
tal image contains a 
pair of disjoint subsets 
of ■ C K 2 , "black" and 
"white." Let A denote 
the closure of the set in 
black and let B denote 
the closure of its comple- 
ment. Find a pair of points 
x G A and y G B, such 
that d(x,y)=d(A, B). 
Find a pair of points 
x G A and y G B such 
that d(x, y) = d(B, A). 
Why do we "close" the 
sets before we begin? 



6.10. Find ^(France, U.S.A.) and d(U.S.A., France) on (C, spherical metric). 
Which is larger? Also compare ^(Georgia, U.S.A.) to d(U.S.A., Georgia). 

6.11 . Let (X, d) be a complete metric space. Let A and B be points in W(X) such 
that A # B. Show that either d(A, B) # 0 or d(B, A) # 0. Show that if A c B then 
d(A, B) = 0. 

6.12. Let (X, d) be a complete metric space. Show that if A, B and C e H{\) then 
B c C =► d(A, C) < d(A, B). (Hint: Use 6.3.) 

6.13. Let (X, d) be a complete metric space. Show that if A, B, and C <E ft(X) then 
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Figure 11.24. Find a 

pair of points x and y, 
one in the dark fern and 
one in the pale fern, such 
that the Hausdorff distance 
between the two fern 
images is the same as 
the distance between the 
points. 




d(A UB,C)= d(A, C) V d(B, C). 

We use the notation x V _y to mean the maximum of the two real numbers x and _y. 

Proof d(A U B, C) = max{^(x, C) : x € A U B] - max{^(x, C) : x e A) V 
max{d(x, C) : x € B}. 

6. 14. Let A, B, and C belong to H(X), where (X, d) is a metric space. Show that 

d(A, B)<d(A, C)+d{C, B). 
Also determine whether or not the inequality 

d(A, B) <d{C, A) + d(C, B) 

is true in general. 

Definition 6.4 Let (X, d) be a complete metric space. Then the Hausdorff dis- 
tance between points A andB in H(X) is defined by 

h(A, B) = d(A, B)vd(B, A). 



Examples & Exercises 

6. 15. Show that h is a metric on the space 7Y(X). 
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Proof Let A,B,Ce H(X). Clearly h(A, A) = d(A, A) v d(A, A) = d(A, A) = 
msoi{d(x, A) : x e A} = 0. h(A, B) = d(a, b) for some a e A and b e B, using the 
compactness of A and B. Hence 0 < h(A, B) < oo. If A / B we can assume there 
is an a e A so that a g B. It follows that h(A, B) > d(A, B) > 0. To show that 
h(A, B) < h(A, C) + h(C, B) we first show that d(A, B) < d(A, C) + d(C, B). We 
have for any a e A 

d(a,B)=min{d(a,b):beB} 

< min{d(a, c) + d(c, b) : b e B\ic e C, 
= d(a, c) + mm{d(c, b) : b e fi}Vc e C, so 
d(a, B) < min{d(a, c):ceC}+ max{min{^(c, b) : b e B} : c e C}, 

= d(a,C) + d(C,B), so 
d(A,B)=d(A,C) + d(C, B). 
Similarly, 

d(B, A) < d(B, C) + d(C, A), whence 

h(A, B) = d(A, B) v d(B, A) < d(B, C) v d(C, B) + d(A, C) v d(C, A) 
= h(B, C) + h(A, C), as desired. 

6.16. Show that h(A UB,CUD)< h(A, C) v h(B, D), for all A, B, C, and D e 
H(X). 

6.17. Show that h(A, B) = d(a, b) for some a e B and be B. 

6.18. The same situation as in 6.9, but this time locate a pair of points x e A and 
y e B such that d(x, y) = h(A, B), the Hausdorff distance from A to B. 

6.19. Let 5cK, with 5/0. The supremum of 5 is denoted by sup 5. The infinum 
of 5 is denoted by inf 5. If there is no real number greater than all of the numbers 
in 5, then sup 5 = +oo; otherwise sup 5 = min{^ e R : x > sVs e S}. If there is 
no real number less than all of the numbers in 5, then inf 5 = -oo; otherwise 
inf 5 = max {x e R : x < sWs e S}. Show that sup 5 and inf 5 are well defined. Show 
that if 5 is compact then sup 5 = max 5 and inf 5 = min 5. Further exercises on sup 
and inf are given following Definition 6.2 in Chapter III. 

By replacing max by sup and min by inf, respectively, throughout the definition 
of the Hausdorff metric, define a "distance" between arbitrary pairs of subsets of a 
metric space. Give several reasons why this "distance" is not usually a metric. 



7 The Completeness of the Space of Fractals 

We refer to (H(X), h) as the "space of fractals." It is too soon to be formal about 
the exact meaning of a "fractal." At the present stage of development of science and 
mathematics, the idea of a fractal is most useful as a broad concept. Fractals are not 
defined by a short legalistic statement, but by the many pictures and contexts that 
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refer to them. For the first eight chapters of this book, any subset of (H(X), h) is a 
fractal. However, as with the concept of a "space," more meaning is suggested than 
is formalized. 

In this section our principal goal is to establish that the space of fractals (H(X), h) 
is a complete metric space. We also want to characterize convergent sequences in 
H(X). To achieve these goals using only the tools introduced so far is quite difficult. 
Indeed, at this juncture, we want to introduce another notion; namely, the idea of 
extending certain Cauchy subsequences. 

Definition 7. 1 Let S C X and let T > 0. Then S + T = {y eX : d(x, y) < T for 
some x e S}. S + T is sometimes called, for example, in the theory of set morphol- 
ogy, the dilation of S by a ball of radius T. 

Lemma 7.1 Let A and B belong to H(X) where (X, d) is a metric space. Let 
€ > 0. Then 

h(A, B) <€ & AC B + € and BcA+e. 

Proof Begin by showing that d(A, B) < € o A c B + €. Suppose d(A, B) 
< e. Then max{d(a, B) : a e A) < € implies d(a, B) < e for all a e A. Hence 
for each a e A we have a e B + e. Hence "A c B + <?." Suppose "A c B + e." 
Consider d(A, B) = max{d(a, B) : a e A}. Let a e A. Since A c B + e, there is 
a, b e B so that d(a, b)<€ for all a e A. Hence d(a, B) < e. This is true for each 
a e A. So "d(A, B) < <?." This completes the proof. 

Let [A n : n = 1, 2, . . . , oo} be a Cauchy sequence of sets in (H(X), h). That is, 
given € > 0, there is N so that n,m>N implies 

A n + e D A m and A ra +eD A n , 

i.e., h(A n , A m ) < e. We are concerned with Cauchy sequences {* n }£Li in X with the 
property that x n e A n for each n. In particular, we need the following property that 
allows the extension of a Cauchy subsequence {x n . e A n .}JL v with the property that 
x n e A nj for each to a Cauchy sequence {x n e A n }% v 

Lemma 7.2 The Extension Lemma. Let (X, d) be a metric space. Let {A n : 

n - 1, 2, . . . , oo) be a Cauchy sequence of points in (H(X), h). Let {nn^Jli be an 
infinite sequence of integers 

0 < n\ < n 2 < «3 < • • • • 

Suppose that we have a Cauchy sequence {x nj e A n . : j = 1, 2, 3, . . .} in (X, d). 
Then there is a Cauchy sequence {x n e A n : n = 1, 2, . . .} such that x nj = x nj , for 
all j = 1,2,3,.... 

Proof We give the construction of the sequence {x n e A n : n = 1, 2, . . .}. For 
each n € {1, 2, ... , n x ), choose x n e [x e A n : d(x, x ni ) = d(x ni , A n )}. That is, x n is 
the closest point (or one of the closest points) in A n to x nr The existence of such a 
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Figure 11.25. The be- 
ginning of a Cauchy 
sequence {A n } of sets 
in H(R 2 ) is shown. A 
Cauchy subsequence of 
points {*„.} belonging to 
a subsequence of the sets 
is also indicated. Make a 
photocopy of the figure, 
and mark on it the exten- 
sion of the subsequence of 
points to the visible sets in 

{A n }. 



closest point is ensured by the compactness of A n . Similarly, for each j e {2, 3, . . .} 
and each n e {nj + I, . . . , n,+i}> choose x n e {x e A n : d{x, x n] ) = d(x nj , A n )}. 

Now we show that {x n } has the desired properties, that it is indeed an extension 
of {x nj } to {A n }. Clearly x n = x nj and x n e A n , by construction. To show that it 
is a Cauchy sequence let e > 0 be given. There is N\ so that rik, nj > N\ implies 
d(x nk , x nj ) < e/3. There is N2 so that m,n > N 2 implies 

d(A m ,A n )<€/3. 

Let TV = max{7Vi, N 2 } and note that, for m, n > N, 

d(x ) < d(x m , x nj ) + d(x nj ,x nk ) + d(x nk , x n ), 

where m e + 1, +2, . . . , rij) andn e {«/t-i + 1, «/t-i +2, . . . , rik}. Since 
h(A m , A nj ) < e/3 there exists y e A m D ({x nj } + e/3) so that d(x m , x nj ) < e/3. Sim- 
ilarly d{x nk , x n ) < e/3. Hence d(x n , x n ) < e for all m,n > N. This completes the 
proof. 

Examples & Exercises 

7.1. A Cauchy sequence {A n } of sets in (H(K 2 ), h) is sketched in Figure 11.25. The 
underlying metric space is (R 2 , Euclidean). A Cauchy subsequence {x nj e A nj } is 
also shown. Sketch, in the same figure, the extension {*„} of this subsequence to 
{A n }. 

7.2. Repeat 7.1 but this time with reference to Figure 11.26. 
The central result toward which we have been driving is this: 

Theorem 7. 1 The Completeness of the Space of Fractals. Let (X, d) 

be a complete metric space. Then (H(X), h) is a complete metric space. Moreover, 
if{A n e H(X)}' n *L l is a Cauchy sequence, then 

A = lim A n e H(X) 
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can be characterized as follows: 

A = {x e X : there is a Cauchy sequence {x n e A n ) that converges to x}. 

Proof Let {A n } be a Cauchy sequence in H(X) and let A be defined as in the 
statement of the theorem. We break the proof up into the following parts: 

(a) A # 0; 

(b) A is closed and hence complete since X is complete; 

(c) for € > 0 there is N such that for n > N, A C A n + e; 

(d) A is totally bounded and thus by (b) is compact; 

(e) lim A n = A. 

Proof of (a): We shall prove this part by proving the existence of a Cauchy se- 
quence {a, e A,} in X. Toward this end find a sequence of positive integers N\ < 
N2 < N2 < ■ ■ ■ < N n < ■ ■ ■ such that 

1 

h(A m , A n ) < for m, n > N t . 

Choose x Nl e A Nr Then since h(A Nl , A N2 ) < \, we can find an x N2 e A N2 such 
that d{x Nv x Nl ) < j. Assume that we have selected a finite sequence x Ni e A N .\ i = 
1, 2, . . . , k for which d{x Ni ^, x Ni ) < ^pr. Then since h(A Nk , A Nk+l ) < ^, and x Nk e 
A Nk , we can find x Nk+1 e A Nk+l such that d(x Nk , x Nk+i ) < ^. For example let x Nk+l be 
the point in A Nk+l that is closest to jc^. By induction we can find an infinite sequence 
{x Ni e A N .} such that d(x Nj , x Nj+i ) < To see that {x Ni } is a Cauchy sequence in X, 
let € > 0 and choose N f such that YI^n, < € • T nen for m > n > N € we have 

d(x Nm , x N J < d(x Nm , x Nm+l ) + d(x Nm+l ,x Nm+2 ) H h d(x Nn _^ x Nn ) 

00 j 

By the Extension Lemma, there exists a convergent subsequence {a, e A, } for which 
a N . = x Nj . Then lim a, exists and by definition is in A. Thus A / 0. 

Proof of (b): To show that A is closed, suppose {a, e A, } is a sequence that con- 
verges to a point a. We will show that a e A, hence making A closed. For each pos- 
itive integer /, there exists a sequence {x i<n e A n } such that lim^oo*,,,, = a,. There 
exists an increasing sequence of positive numbers {Ni}^ such that d(aN n a) < j. 



Figure 11.26. The same 
problem as for Fig- 
ure 11.25. The sets {A n } 
look very different here. 
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Furthermore, there is a sequence of integers {m,} such that d{xN utnn a Ni ) < 4. Thus 
i(jcjv,, mj , a) < 2/i. If we let y m . = x N ., mi we see that y m . e A m . and lim i -> ooy mi 
a. By the Extension Lemma, {y m .} can be extended to a convergent sequence {z, G 
A,}, and so a e A. Thus we have shown A is closed. 

Proof of (c): Let e > 0. There exists an N such that for m,n > N, h(A m , A n ) < e. 
Now let n > N. Then for m > n, A m C A n + e. We need to show that A C A n + e. 
To do this, let a G A. There is a sequence {a, G A,} that converges to a. We may 
assume N is also large enough so that for m > N, d(a m , a) < e. Then a m e A n + e 
since A m c A„ + e. Since A„ is compact, it can be shown that A n + e is closed. Then 
since a m g A n + e for all m > N, a must also be in A n + e. This completes the proof 
that A c A„ + e for n large enough. 

Proof of (d): Suppose A were not totally bounded. Then for some e > 0 there 
would not exist a finite e-net. We could then find a sequence {jc,-}^j in A such that 
d(Xi, xj) > e for i / j. We shall show that this gives a contradiction. By (c) there 
exists an n large enough so that A c A n + f . For each x ( , there is a corresponding 
)>,• € A„ for which d(xi, v ( ) < |. Since A„ is compact, some subsequence {y ni } of 
{y,} converges. Then we can find points in the sequence {y ni } as close together as we 
wish. In particular we- can find two points v„. and y Kj such that d(y nn y nj ) < |. But 
then 

€ € € 

d(x n ., x nj ) < d(y„., y ni ) + d(y„., y nj ) + d(y np x n .) < - + - + - , 

and we have a contradiction to the way {x ni } was chosen. Thus A is totally bounded 
and by part (b) compact. 

Proof of (e): From part (d), A e H(X). Hence by part (c) and Lemma 7.1 the 
proof that lim A, = A will be complete if we show that for e > 0, there exists an 
N such that for n > N, A n c A + e. To show this let e > 0 and find N such that 
for m,n> N, h(A m , A n ) < |. Then for m, n > N, A m c A„ + |. Let n > N. We 
will show that A„ c A + e. Let _y g A„. There exists an increasing sequence {jV, } 
of integers such that n < N { < N 2 < N 3 < • • • < N k < ■ ■ ■ and for m,n > Nj, A rn C 
A n + 27h- Note that A„ C A^, + f . Since v g A„, there is an G A^, such that 
d(y, x Ni ) < |. Since jc^, € A Nl , there is apointx^ 2 e A^ 2 such that *w 2 ) - ^« 

In a similar manner we can use induction to find a sequence x Ni , x Nl , x Ni , . . ., such 
that x Nj G A N . and < jTh . Using the triangle inequality a number of 

times we can show that 

d{y,x Nj )< € - for all j 

and also that {x Nj } is a Cauchy sequence. From the way n was chosen, each A Nj C 
A„ + |. converges to a point x and since A„ + | is closed, x G A„ + | also. 
Moreover, d(y,x Nj ) implies that <i(y, < e. We have thus shown that A n c 
A + e for n > N. This completes the proof that lim A n = A and consequently that 
(H(X), h) is a complete metric space. 
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Examples & Exercises 

7.3. A tree waves in the wind. A special camera photographs the tree at times t n = 

(I _ I) S ec n = 1, 2, 3 Show that with reasonable assumptions the sequence 

of pictures thus obtained form a Cauchy sequence [A n }™ =l in H(R l ). What does 
A = lim^oo A n look like? 

7.4. The Sierpinski triangle A is a compact subset of (R 2 , Euclidean). Hence 
(A, Euclidean) is a compact metric space. Given an example of an infinite set in 
(H(A, h), demonstrate a Cauchy sequence [A n e H(&)} that is contained in your 
set and describe its limit. 

7.5. Figure 11.27 shows a convergent sequence of sets in H(u) converging to a fern. 
Pick a point in A. Find a Cauchy sequence {x n e A n } that converges to it. 

*7.6. Let (X, d) be a compact metric space. Show that (W(X), h) is a compact 
metric space, where h is the Hausdorff metric on the space H(X). 

It's a good idea to get familiar with H(X), by trying to see what properties of the 
space X are also true of H(X). We have seen that completeness is one of these, and 
exercise 7.6 shows that compactness is as well. So are some forms of connectedness, 
and by way of example we prove this for path-connectedness of H(R) as follows: 

Theorem 7.2 The function f : R -> H given by f(x) = {x} is continuous. 

Proof Let {x n } be a sequence in R that converges to a point x. Then given e > 
0 there is an N such that for n > N, d(x n , x) < e. By definition, h([x n }, {x}) = 
d(x n ,x), since there is only one element in each set. Hence {{x n }} converges to {x} 
in H(R). So the function f(x) is continuous. Thus the image of R in H(R) is path- 
connected. Notice that a function like this guarantees that there will be a copy of any 
space in its associated space of non-empty compact sets. 

Theorem 7.3 The functions f x : [0, 1] -> H given by f x (a) = [x,x+ a], 0 < 
a < 1 are continuous, thereby showing that there is a path in Hfrom an interval to 
one of its endpoints. 

Proof As before, let {a n } be a convergent sequence in [0, 1] with limit a. Suppose 
d(a n , a) < e. Then the distance h([x, x + a n ], [x, x + a]) is given by 
h([x, x + a n ], [x, x +a]) = d([x, x + a n ], [x, x + a]) v d([x, x + a], [x, x + a n ]) 

= d(a, a n ) < e. 
Hence each of the functions f x is continuous. 

Theorem 7.4 If A is a compact subset of R then the function f A : [0, b] -> H 
given by f A (a) = \J[x , x + a] such that x e A is continuous. 

Proof By part (b), the function f A is continuous at each point in A, when b=\. 
Hence the statement is true for b = 1 . To get the whole statement, we note that if 
g : [0, b\ -> [0, 1] is given by g(x) = (\/b)x, g is continuous. Hence f A can be 
written as the composition of two continuous functions and is continuous. 
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Figure 11.27. ACauchy 

sequence of sets {A n } in 
the space H(K 2 ) converg- 
ing to a fern-like set. 
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Theorem 7.5 If A is a compact subset of R then the set [J[x, x + b] such that 
x € A is an interval for b large enough. 

Proof A is compact, hence it is bounded. If we take b to be the length of any 
interval [c, d] such that A c [c, d], then the set [x, x + b] with x the least element 
of A will overlap the set [y, y + b] where y is the greatest element of A. Thus all 
the sets of this form will overlap, and their union forms one interval, which is path- 
connected. 

Theorem 7.6 If A and B are compact subsets of R then there is a path in 7i 
connecting them. 

Proof For any space X and three points a, b, c e X, if there is a path from a to b 
and one from b to c then there is one from a to c. We construct a path between A and 
B in H(R) by taking first a path from A to the interval formed in part (d), namely 
choose b as in (d) and construct the path 

/ : [0, 1] -+ [0, b] -+ {f A (x) :0<x<b}, 

which is continuous by part (c). Thus there is a path between every point in 7i(R) 
and a point that is an interval. Similarly, there is a path from some interval to every 
point in H(R). By part (b) there is a path from every interval to one of its endpoints. 
By part (a) there is a path (the image of R) between any two of these endpoints. 
Therefore a path from A to B may be constructed by taking A to an interval, taking 
the interval to an endpoint, moving to the a new point of R, making a path to 
an interval, and taking the path from the interval to B. Hence H(R) is pathwise- 
connected. It is true, though somewhat more complicated, that if X is connected then 
so is H(X). 



8 Additional Theorems about Metric Spaces 

We state here a number of theorems that we shall use later on. Full proofs are not 
provided. They can be found in most introductory topology texts. We particularly 
recommend [Kasriel 1971] and [Mendelson 1963]. These theorems may be treated 
as exercises in metric space theory. 

Theorem 8.1 Let (X,d) be a metric space. Let {x n } be a Cauchy sequence 
convergent to x € X (or equivalently let {x n } be a sequence and x be a point, such 
that lim^oo d(x, x n ) = 0). Let f : X — >• X be continuous. Then 

lim f(x n ) = f(x). 

n-*oo 

Proof See your first calculus book. 

Theorem 8.2 Let (Xi, d\) and (X2, d{) be metric spaces. Let f : X\ — >• X2 be 

continuous. Let E C X\ be compact. Then /:£—>• X2 is uniformly continuous: that 
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is, given e > 0 there is a number 8 > 0 so that 

d 2 (f(x), f{y)) < € whenever d\{x, y) < 8 for all x,y e E. 

Proof Use the fact that any open cover of E contains a finite subcover. 

Theorem 8.3 Let (X, , d t ) be metric spaces for i = 1, 2, 3. Let f :X\ x X 2 -> 

X 3 have the following property. For each e > 0 there exists 8 > 0 such that 

(i) rfiUi.yi) <s=*df 3 (/(*i.*2),/(:yi,:y2)) <*, 

Vjfi, Ji € X] 
Vx 2 € X 2 

(ii) d 2 (x 2 , yi) < 8 d 3 (f(y u x 2 ), f(y\, y 2 )) < e, 

Vji e Xi 
Vx 2 , j 2 e X 2 

Then f is continuous on the metric space (X = Xi x X 2 , d), where d((xux 2 ), (y\, 
y 2 )) = max{ d x (x u y x ), d 2 (x 2 , y 2 )}. 

Proof Use 

d(f(x u x 2 ), f(yuyi)) < d(f(x u x 2 ), f(y u x 2 )) + d(f(y u x 2 ), f(y\, yi)), 

but check first that d is a metric. 

Theorem 8.4 Let (X,, d t ) be metric spaces for i = 1,2 and let the metric space 
(X, d) be defined as in Theorem 8.3. If K x <z^\ and K 2 C X 2 are compact, then 
K\ x K 2 C X is compact. 

Proof Deal with the component in K\ first. 

Theorem 8.5 Let (X,, d t ) be compact metric spaces for i = 1,2. Let f : X t ->• 
X 2 be continuous, one-to-one, and onto. Then f is a home omor phi sm. 



Chapter III 

Transformations 
on Metric Spaces; 
Contraction Mappings; 
and the Construction 
of Fractals 



1 Transformations on the Real Line 

Fractal geometry studies "complicated" subsets of geometrically "simple" spaces 
such as R 2 , C, R, and C In deterministic fractal geometry the focus is on those 
subsets of a space that are generated by, or possess invariance properties under, 
simple geometrical transformations of the space into itself. A simple geometrical 
transformation is one that is easily conveyed or explained to someone else. Usually 
it can be completely specified by a small set of parameters. Examples include affine 
transformations in R 2 , which are expressed using 2x2 matrices and 2- vectors, and 
rational transformations on the Riemann Sphere, which require the specification of 
the coefficients in a pair of polynomials. 

Definition 1 . 1 Let (X, d) be a metric space. A transformation on X is a function 
f : X — >• X, which assigns exactly one point f (x) eX to each point x e X. If S c X 
then f(S) = {f(x) : x e S}. f is one-to-one if x, y e X with f (x) = f(y) implies 
x = y. f is onto if /(X) = X. / is called invertible if it is one-to-one and onto: in 
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this case it is possible to define a transformation f 1 : X — > X, called the inverse of 
f by f~ l (y) = x, where x € X is the unique point such that y — f(x). 

Definition 1 .2 Let f: X — > X be a transformation on a metric space. The forward 
iterates of f are transformations f on : X X defined by f o0 (x) — x, f ol (x) = 
fix), f o(n+l) (x) = f o f M (x) = f(f {n) {x)) for n = 0,1,2, .... If f is invertible 
then the backward iterates of f are transformations f o{ ~ m) {x) : X — > X defined by 
f o( --»(x) = f~\x), f o{ - m \x) = {f om )-\x)form = 1, 2, 3, ... . 

In order to work in fractal geometry one needs to be familiar with the basic fam- 
ilies of transformations in R, R 2 , C, and C One needs to know well the relation- 
ship between "formulas" for transformations and the geometric changes, stretchings, 
twistings, foldings, and skewings of the underlying fabric, the metric space upon 
which they act. It is more important to understand what the transformations do to 
sets than how they act on individual points. So, for example, it is more useful to 
know how an affine transformation in R 2 acts on a straight line, a circle, or a trian- 
gle, than to know to where it takes the origin. 

Examples & Exercises 

1.1. Let f: X — > X be an invertible transformation. Show that 

jam Q j-on = fo{m+n) for a jj integers m and „ . 

1.2. A transformation / : R — > R is defined by f(x) = 2x for all x e R. Is / in- 
vertible? Find a formula for f on (x) that applies for all integers n. 

1.3. A transformation / : [0, 1] — > [0, 1] is defined by f(x) = ^x. Is this transfor- 
mation one-to-one? Onto? Invertible? 

1.4. The mapping / : [0, 1] — > [0, 1] is defined by f(x) = 4x • (1 — x). Is this trans- 
formation one-to-one? Onto? Is it invertible? 

1.5. Let C denote the Classical Cantor Set. This subset of the metric space [0, 1] 
is obtained by successive deletion of middle-third open subintervals as follows. We 
construct a nested sequence of closed intervals 

h 3 h D h D h D h D h D h D h ■ ■ ■ D In D ■ ■ ■ , 
where A 
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Figure 111.28. Con- 
struction of the Classical 
Cantor Set C. 
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/ 4 = /3 take away the middle open third of each interval in h, 



l n = I N _ X take away the middle open third of each interval in I N -\. 
This construction is illustrated in Figure III. 28. We define 

n — n 00 / 

C contains the point x = 0, so it is nonempty. In fact C is a perfect set that contains 
uncountably many points, as discussed in Chapter IV. C is an official fractal and we 
will often refer to it. 

We are now able to work in the metric space (C, Euclidean). A transformation 
f : c C is defined by f(x) = \x. Show that this transformation is one-to-one but 
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Subsets of I are transported 



Figure 111.29. The ac- 
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a real interval 
of length L / 




f ^^length |a| L 
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tion of the affine transfor- 
mation / : 1 -* 1 defined 

by f(x) = ax + b. 



rotate 180° about b 
if a is less than zero. 
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Figure 111.30. This fig- 
ure suggests a sequence of 
intervals {/„}^L 0 . Find an 
affine transformation / : 
1 -» 1 so that f°"(I 0 ) = 
I n for n = 0, 1, 2, 3, ... . 
Use a straight-edge and 
dividers to help you. 
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3 '4 



not onto. Also, find another affine transformation (see example 1.7, which maps C 
one-to-one into C). 

1.6. / : R 2 -> R 2 is defined by /(jci,jc 2 ) = (2x u x 2 +x l ) for all (x u x 2 ) € R 2 . 
Show that / is not invertible. Give a formula for f o2 (x). 

1 .7. Affine transformations in R 1 are transformations of the form f(x) = a ■ x + b, 
where a and b are real constants. Given the interval / — [0, 1], /(/) is a new interval 
of length \a\, and / rescales by a. The left endpoint 0 of the interval is moved to b, 
and /(/) lies to the left or right of b according to whether a is positive or negative, 
respectively (see Figure 111.29). 

We think of the action of an affine transformation on all of R as follows: the whole 
line is stretched away from the origin if \a\ > 1, or contracted toward it if \a\ < 1; 
flipped through 180° about O if a < 0; and then translated (shifted as a whole) by 
an amount b (shift to the left if b < 0, and to the right if b > 0). 

1 .8. Describe the set of affine transformations that takes the real interval X = [1, 2] 
into itself. Show that if / and g are two such transformations then fog and go/ 
are also affine transformations on [1, 2]. Under what conditions does / o g(X) U g o 
/(X)=X? 

1.9. A sequence of intervals {/«}^1 0 is indicated in Figure III. 30. Find an affine 
transformation / : R -> R so that / on (/o) = /„ for n = 0, 1, 2, 3, Use a straight- 
edge and dividers to help you. Also show that {I n }^L\ is a Cauchy sequence in 
(7i(R), h), where h isithe Hausdorff distance on H(R) induced by the Euclidean 
metric on R. Evaluate / = lim^oo /„. 
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Figure 111.31. Picture 
of a convergent geometric 
series in 1R 1 (see exer- 
cise 1.10). 



1.10. Consider the geometric series Yl^Lo ba n =b + ab + a 2 b + cr'b + a 4 b + 
■ ■ ■ > 0, 0 < b < 1. This is associated with a sequence of intervals 7 0 = [0, b], I n = 
/°"(/ 0 ), where f(x) = ax + b, n = 1, 2, 3, . . ., as illustrated in Figure 111.31. 

Let / = U^ 0 /„ and let / denote the total length of /. Show that /(/) = / \ 7 0 , and 
hence deduce that al = / — b so that / = b/(l — a). Deduce at once that 

oo 

^b ■ a" =b/(\ - a). 

Thus we see from a geometrical point of view a well-known result about geometric 
series. Make a similar geometrical argument to cover the case — 1 < a < 0. 

Definition 1 .3 A transformation f : R -> R of the form 

f(x) = a Q + a\x + a2X 2 + a^x 3 + • • ■ + a n x" , 

where the coefficients a l (i = 0, 1, 2, . . . , AO are real numbers, a n / 0, and N is a 
nonnegative integer, is called a polynomial transformation. N is called the degree of 
the transformation. 

Examples & Exercises 

1.11. Show that if / : R -> R and g : R -> R are polynomial transformations, then 
so is / o g. If / is of degree N, calculate the degee of f om (x) for m = 1, 2, 3, 

1.12. Show that for n > 1 a polynomial transformation f:R-> Rof degree n is not 
generally invertible. 

1.13. Show that far enough out (i.e., for large enough \x\), a polynomial transfor- 
mation / : R -> R always stretches intervals. That is, view /' as a transformation 
from (R, Euclidean) into itself. Show that if / is an interval of the form I = {x : 
|jc — a | < b] for fixed a, b e R, then for any number M > 0 there is a number ft > 0 
such that if b > ft, then the ratio (length of / (7))/(length of /) is larger than M. This 
idea is illustrated in Figure III. 32. 

1.14. A polynomial transformation /' : IR — > [R. of degree n can produce at most 
(n - 1) folds. For example / (x) = x 3 - 3.v + 1 behaves as shown in Figure III. 33. 

1.15. Find a family of polynomial transformations of degree 2 which map the inter- 
val [0, 2] into itself, such that, with one exception, if v e /([(), 2]) then there exist 
two distinct points x\ and xj in [0, 2] with f\x\) = /U2) = v. 

1.16. Show that the one-parameter family of polynomial transformations f x : 
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Figure 111.32. A poly- 
nomial transformation 
/ : [R -> [R of degree > 1 
stretches [R more and more 
the farther out one goes. 



[0, 2]-+ [0, 2], where 

fx(x)=X-x -(2-x), 



and the parameter X belongs to [0,1], indeed takes the interval [0, 2] into itself. 
Locate the value of x at which the fold occurs. Sketch the behavior of the family, 
in the spirit of Figure 111.33. 

1.17. Let / : 0& 0& be a polynomial transformation of degree n. Show that values 
of x that are transformed into fold points are solutions of 

df 
ax 

Solutions of this equation are called (real) critical points of the function /. If c is a 
critical point then f(c) is a critical value. Show that a critical value need not be a 
fold point. 

1.18. Find a polynomial transformation such that Figure 111.34 is true. 

1.19. Recall that a polynomial transformation of an interval / : / c R -> / is nor- 
mally represented as in Figure 111.35. This will be useful when we study iterates 
f on (x)™ =v However, the folding point of view helps us to understand the idea of 
the deformation of space. 

1 .20. Polynomial transformations can be lifted to act on subsets of K 2 in a simple 



triply folded region 
fold point fold point 



Figure 111.33. The 

polynomial transformation 
f(x) =jc 3 - 3x + 1. 
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Figure 111.34. Find a 

polynomial transformation 
/ : 0& -> Dfc, so that this 
figure correctly represents 
the way it folds on the real 
line. 



fold point fold point 



fold point 

i 

-r r — 



D 



Figure 111.35. The 

usual way of picturing a 
polynomial transforma- 
tion. 



critical 
value 




way: we can define, for example, F(x) = (f\(x\), f 2 (x 2 )), where f\ and f 2 are 
polynomial transformations in R, so that F : R 2 — ► R 2 . Desired foldings in two 
orthogonal directions can be produced; or shrinking in one direction and folding in 
another. Show that the transformation F(x\, x 2 ) = (f x* - yxj + y*i, x 2 ) acts on 
the triangular set S in Figure III. 36 as shown. 

The real line can be extended to a space which is topologically a circle by includ- 
ing the point at infinity. One way to do this is to think of R as a subset of C, and 



Figure 111.36. A poly- 
nomial transformation 
acting on a set S in the 
plane. 





1 


fold lines 


\ double cover 
triple cover 




1 1 1 I 1 1 1 1 

0 1 2 3 4 5 6 7 8 


( — 1 — 1 — T*t — 1 — 1 — 1 — rr 

01 2 3 '\ 5 6789 
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Figure 111.37. !Ru{oo} 
becomes a circle on a 
sphere. 



O 



then include the North Pole on C We define this space to be R = R U {oo} and will 
usually give it the spherical metric. 

Definition 1 .4 A transformation f : R R defined in the form 
ax + b 

f(x) = , a, b, c, d e R, ad ^ be, 

cx +d 

is called a linear fractional transformation or a Mobius transformation. Ifc ^ 0, then 
f(-d/c) = oo, and f(oo) = a/c. Ifc = 0, then /(oo) = oo. 

Examples & Exercises 

1.21. Show that a Mobius transformation is invertible. 

1.22. Show that if f\ and fa are both Mobius transformations then so is fa o fa. 

1.23. What does fix) = l/x do to R on the sphere? 

1.24. Show that the set of Mobius transformations / such that /(oo) = oo is the 
set of affine transformations. 

1.25. Find a Mobius transformation / : R R so that /(l) = 2, f(2) = 0, 
/(0) = oo. Evaluate /(oo). 

1.26. Figure 111.38 shows a Sierpinski triangle before and after the polynomial 
transformation x ax(x — b) has been applied to the x-axis. Evaluate the real con- 
stants a and b. Notice how well fractals can be used to illustrate how a transforma- 
tion acts. 



2 Affine Transformations in the Euclidean Plane 

Definition 2.1 A transformation w : R 2 R 2 of the form 

w(x\, x 2 ) = (axi + bx 2 + e, cx\ + dx 2 + /), 

where a, b, c, d, e, and f are real numbers, is called a (two-dimensional) affine 
transformation. 
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We will often use the following equivalent notations 



Here A = 



:)(:;)+(;)— 

( a ) is a two-dimensional, 2x2 real matrix and t is the column 
c d) 



vector \^jy which we do not distinguish from the coordinate pair (e, f) e E 2 . 

Such transformations have important geometrical and algebraic properties. From this 
point on, we shall assume that the reader is familiar with matrix multiplication. 
The matrix A can always be written in the form 

Yicos#i — r 2 sin#2 
r\ sin 0\ cos 62 



c :)-(■ 



AFTER (X,Y)^(aX (X-b),Y) 



BEFORE 

TRANSFORMATION 




Figure 111.38. A Sierpinski triangle before and after the polynomial transformation 
x 1— > qx(x — b) is applied to the x-axis. Evaluate the real constants a and b. 
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Figure 111.39. An affine 
transformation takes par- 
allelograms into parallelo- 
grams. 



-1 0 1 -1 n l 




Figure 111.40. A linear 
transformation can turn 
pictures over. 



0 



where (n, 6\) are the polar coordinates of the point (a, c) and (r 2 , (#2 + tt/2)) are 
the polar coordinates of the point (b, d). The linear transformation 

in R 2 maps any parallelogram with a vertex at the origin to another parallelogram 
with a vertex at the origin, as illustrated in Figure III. 39. Notice that the parallelo- 
gram may be "turned over" by the transformation, as illustrated in Figure 111.40. 

The general affine transformation w(x) = Ax + r in R 2 consists of a linear trans- 
formation, A which deforms space relative to the origin, as described above, fol- 
lowed by a translation or shift specified by the vector t (see Figure 111.41). 
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Figure 111.41. An affine 
transformation consists 
of a linear transformation 
followed by a translation. 



Figure 111.42. Two 

ivy leaves lying on the 
Euclidean Plane determine 
an affine transformation. 




How can one find an affine transformation that approximately transforms one 
given set into another given set in IR 2 ? Let's show how to find the affine transfor- 
mation that almost takes the big leaf to the little leaf in Figure 111.42. This figure 
actually shows a photocopy of two real ivy leaves. We wish to find the numbers 
a,b,c,d,e, and / defined above, so that 

w(BlG LEAF) approximately equals LITTLE LEAF. 
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Begin by introducing x and y coordinate axes, as already shown in Figure 111.42. 
Mark three points on the big leaf (we've chosen the leaf tip, a side spike, and the 
point where the stem joins the leaf) and determine their coordinates (jci , X2) , (y\ , y 2 ) , 
and (n, z 2 )- Mark the corresponding points on the little leaf, assuming that a cater- 
pillar hasn't eaten them, and determine their coordinates; say (x\, X2), (yi, y 2 ), an d 
(z\, I2), respectively. 

Then a, b, and e are obtained by solving the three linear equations 

X\a + X2b + e = X\, 
y x a + y 2 b + e = yu 
z\a + Z2b + e = l\, 

while c, d, and / satisfy 

X\C + X2d + f = X2, 
y\c + y 2 d + f = y 2 , 
Z\C + Z2d + / = Z2- 

Examples & Exercises 

2.1. Find an affine transformation in IR 2 that takes the triangle with vertices at 
(0, 0), (0, 1), (1,0) to the triangle with vertices at (4, 5), (-1,2), and (3, 0). Show 
what this transformation does to a circle inscribed in the first triangle. 

2.2. Show that a necessary and sufficient condition for the affine transformation 

(: :)(;;M;H« 

to be invertible is det A / 0, where det A = (ad — be) is the determinant of the 2 x 2 
matrix A. 

2.3. Show that if f x : R 2 -> IR 2 and f 2 : R 2 -> R 2 are both affine transformations, 
then so is 

h = /1 0 h- 

If fi(x) = AiX + ti,i = 1, 2, 3, where A, is a 2 x 2 real matrix, express A3 in terms 
of A\ and A 2 . 

2.4. Let A and B be 2 x 2 matrices, with determinants det A and detfi, respec- 
tively. Show that the determinant of the product is the product of the determinants, 
i.e., 

det(Afi) = det A • det B. 
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Definition 2.2 A transformation w : R 2 — > R 2 is called a similitude it is an 

affine transformation having one of the special forms 

(x\ \ _ ( r cos 0 — r sin 0 \ ( x\ \ ( e \ 
X2 J \ r sin 0 r cos 6 J \X2 J \f J 

(jci\_/rcos# rsinO \ / xi \ / e \ 
*2 ) \ r sin 6 —r cos 6 ) \X2 ) \f J 

for some translation (e, f) e R 2 , some real number r^O, and some angle 9,0 < 
6 < 2tt . 0 is called the rotation angle while r is called the scale factor or scaling. 
The linear transformation 



is a rotation. The linear transformation 



is a reflection. 



^ fx\\_^rcosO — rs\nd\/x\\ 
6 \*2 ) \rsin0 rcos# ) \X2 ) 
nsformation 

•O-G -■)(::) 



Figure 111.43 shows some of the things a similitude can do. Notice that a simili- 
tude preserves angles. 

Examples & Exercises 

2.5. Find the scaling ratios r\, r2 and the rotation angles Q\, 62 for the affine trans- 
formation that takes the triangle (0, 0), (0, 1), (1, 0) onto the straight-line segment 
from (1, 1) to (2, 2) in R 2 in such a way that both (0, 1) and (1,0) go to (1, 1). 

2.6. Let S be a region in R 2 bounded by a polygon or other "nice" boundary. Let 
w : R 2 — ► R 2 be an affine transformation, w(x) = Ax + t. Show that 

(area of w(S)) = \ det A| • (area of S); 

see Figure 111.44. Show that det A < 0 has the interpretation that S is "flipped over" 
by the transformation. (Hint: suppose first that S is a triangle.) 

2.7. Show that if w : R 2 — > R 2 is a similitude, w(x) = Ax + t, where t is the trans- 
lation and A is a 2 x 2 matrix, then A can always be written either A = rR e or 
A = rRR 0 . 

2.8. View the railway tracks image in Figure 111.45 as a subset S of R 2 . Find a 
similitude w : R 2 R 2 such that w(S) C S, w(S) # S. 

2.9. We use the notation introduced in Definition 2.2. Find a nonzero real number 
r, an angle 0, and a translation vector t such that the similitude wx = rRex + t on 
R 2 obeys 

iu(A) C A, with iu(A) ^ A, 
where A denotes a Sierpinski triangle with vertices at (0, 0), (1, 0), and (\, 1). 
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2.10. Show that if w : R 2 -> R 2 is affine, w(x) = Ax + 1, then it can be reexpressed 

•«=(s :)(;:)- 

where r,- e R and 0 < 0 < 2tt. We call a transformation of the form 

■(:)-(: ".)(;;) 

a coordinate rescaling. 

2. 1 1 . Let S denote the two-dimensional orchard subset of R 2 shown in Fig- 
ure 111.46. Find two fundamentally different affine transformations that map S into 
S but not onto S. Define the transformations by specifying how they act on three 
points. 

2.1 2. Show that if A is a 2 x 2 matrix such that det A ^ 0, with 

:)• 

then the inverse of A, denoted A 1 ,is given by 
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Figure 111.44. The scal- 



ing factor by which an 
affine transformation 
changes area is deter- 
mined by the determinant 
of its linear part. 




Figure 111.45. Railway 
to infinity. Can you find an 
affine transformation that 
nearly maps the track ties 
into themselves? 
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Figure 111.46. Orchard subset of R 2 . Can you find some interesting affine transformations 
that map this set into itself? 



2. 1 3. The trace of a matrix A is the sum of the elements along the diagonal, that is 

tr A = ^ an . 

Let A be a 2 x 2 matrix, and let B be another 2x2 matrix such that det B / 0. Show 
that 

tr(BAB~ l ) =trA 

and 

det(BAB~ l ) = det A. 

2.14. Let w(x) = Ax denote a linear transformation in the metric space (R 2 , D) 
where 
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«-(: :)■ 

Define the norm of a point x e 0& 2 to be |jc | = D(x, O), where 0 denotes the origin. 
Define the norm of the linear transformation A by 

\A\ = max | :xgR 2 ,x#o) 

when this maximum exists. Show that \A \ is defined when D is the Euclidean metric 
and when it is the Manhattan metric. Find an expression for \A\ in terms of a, b, c, 
and d in each case. Make a geometrical interpretation of \A\. Show that when \A\ 
exists we have 

\Ax\ < \A\ ■ \x\ for all x e R 2 . 



3 Mobius Transformations on the Riemann Sphere 

Definition 3. 1 A transformation f : C -> C defined by 

(az + b) 



/(z) = 



(cz + d) ' 



where a, b, c, and d e C, ad — be / 0, is called a Mobius transformation on C. // 
c / 0 then f(-d/c) — oo, and /(oo) = a/c. If c = 0, then /(oo) = oo. 

As shown by the following exercises and examples, one can think of a Mobius 
transformation as follows. Map the whole plane C, together with the point at infin- 
ity, onto the sphere C, as described in Chapter II. A sequence of operations is then 
applied to the sphere. Each operation is elementary and has the property that it takes 
circles to circles. The possible operations are rotation about an axis, rescaling (uni- 
formly expand or contract the sphere), and translation (the whole sphere is picked 
up and moved to a new place on the plane, without rotation). Finally, the sphere is 
mapped back onto the plane in the usual way. Since the mappings back and forth 
from the plane to the sphere take straight lines and circles in the plane to circles on 
the sphere, we see that a Mobius transformation transforms the set of straight lines 
and circles in the plane onto itself. We also see that a Mobius transformation is in- 
vertible. It is wonderful how the quite complicated geometry of Mobius transforma- 
tions is handled by straightforward complex algebra, where we simply manipulate 
expressions of the form (az + b)/(cz + d). 

Examples & Exercises 

3. 1 . Show that the most general Mobius transformation, which maps oo to oo, is of 
the form f(z) — az + b, a, b e C, a / 0, and that this is a similitude. Show that any 
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Figure 111.47. A Mobius transformation acting on England to produce a new country. 

two-dimensional similitude that does not involve a reflection can be written in this 
form. That is, disregarding changes in notation, 

f(z) = f(xi + ix 2 ) = (a\ + ia 2 )(x\ + ix 2 ) + (b x + ib 2 ) 

= re ie {x x + ix 2 ) + (b x + ib 2 ), (i = V^T) 

(r cos 6 — r sin 6 \ ( x\ \ / b\ \ 
r sin 9 r cos 9 ) \ x 2 J \b 2 ) ' 

Find r and 9 in terms of a\ and a 2 . Show that the transformation can be achieved as 
illustrated in Figure 111.48. 

3.2. Show that the Moljius transformation f(z) = l/z corresponds to first mapping 
the plane to the sphere in such a way that the unit circle [z e C : \z\ = 1} goes to 
the equator, followed by an inversion of the sphere (turn it upside down by rotating 
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about an axis through +1 and -1 on the equator), and finally mapping back to the 
plane. 



3.3. Show that any Mobius transformation that is not a similitude may be written 

/ 

f(z) = e-\ for some e, f, e e C, f ^ 0 

z + g 

3.4. Sketch what happens to the picture in Figure 111.49 under the Mobius transfor- 
mation f(z) = \. 

3.5. What happens to Figure 111.49 under the Mobius transformation f(z) = 1 + i z l 
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(-1. o) 



□ 

(0. 0) 


□ 







(0. -1) 




Figure 111.49. Up the 

Garden Path. What does 
the Mobius transformation 
z \-> 1 + iz do to this 
picture? 



3.6. Show algebraically that a Mobius transformation / : C -> C is always invert- 
ible. 

3.7. Show that if f\ and f 2 are Mobius transformations, then /i o f 2 is a Mobius 
transformation. 

3.8. Find a Mobius transformation that takes the real line to the unit circle centered 
at the origin. 

3.9. Evaluate f on (z) if f(z) = 1/(1 + z),ne {-2, -1, 0, 1, 2, 3, . . .}. 

3.10. Interpret the Mobius transformation f(z) = i + l/(z - 0 in terms of opera- 
tions on the sphere. 



4 Analytic Transformations 

In this section we continue the discussion of transformations on the metric spaces 
(C, Euclidean) and (C, Spherical). We introduce a generalization of the Mobius 
transformations, called analytic transformations. We concentrate on the behavior 
of quadratic transformations. It is recommended that during a first reading or first 
course the reader obtains a good mental picture of how the quadratic transformation 
acts on the sphere. The reader may then want to study this section more closely after 
reading about Julia sets in Chapter VII. 
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The similitude / : C -> C defined by the formula f(z) = 3z + 1 is an example 
of an analytic transformation. It maps circles to circles magnified by a factor of 
three. A disk with center at zo is taken to a disk with center at f(zo) = 3zo + 1. The 
tranformation is continuous, and it maps open sets to open sets. Nowhere does it 
"fold back along the dotted line." 

The similitude / : C -> C defined by f(z) = (3 + 3i)z + (1 - 2i) is similarly de- 
scribed. The circles and disks are now rotated by 45° in addition to being magnified 
and translated. 

Loosely a transformation on C is analytic if it is continuous and it locally "be- 
haves like" a similitude. If you take a very small region indeed (How small? Small 
enough! There is a smallness such that what is about to be said is true!) and you 
watch what the transformation does to that tiny region, you will typically find that 
it is magnified or shrunk, rotated, and translated, in almost exactly the same manner 
that some similitude would do the job. The similitude will always be of the special 
type discussed in exercise 3.1 above. 

We make this description more precise. Let us decide to look at what our transfor- 
mation does in the vicinity of a point zo e Assume that zo is not a critical point, 
defined below. Let T denote a tiny region, a disk for example, which contains the 
point zo- Let f(T) be its image under the transformation. Then one can rescale T by 
a factor that makes it roughly the size of the unit square, and one can rescale f(T) 
by the same factor. The assertion of the previous paragraph is that the action of the 
transformation, viewed as taking T, rescaled, onto f(T), rescaled, can be described 
more accurately by a similitude. If you like, one could consider a picture P drawn 
in T and examine the transformed image f(P): if P and f(P) are rescaled by the 
same factor so that P is the size of the unit square, then f(P) looks more and more 
like a similitude applied to /'. This description becomes more and more precise the 
tinier the region under discussion. 

Consider the quadratic transformation / : C -> C defined by 

f(z) = z 2 = Oi + ix 2 ) 2 = (x 2 - x\) + 2x x x 2 i = fi(x u x 2 ) + f 2 (x u x 2 )i, 

where f\(x\, x 2 ) = (x 2 — x\) is called the real part of f(z), and f 2 (x u x 2 ) = 2x x x 2 
is called the imaginary part of /. Pictures of what this transformation does to some 
Sierpinski triangles in C are illustrated in Figure 111.50. 

Two features are to be noticed. (I) Provided that we stay away from the origin, 
the transformation behaves locally like a similitude: for points z close to zo , f(z) is 
approximated by the similitude 

w(z) =az + b where a = 2zo and b = —zq. 

This fact shows up in Figure 111.50: upon close examination (we suggest the use of a 
magnifying glass here) of the transformed Sierpinski triangles, one sees that they are 
built up out of small triangles whose shapes are only slightly different from that of 
their preimages. The only place where this is not true is at the forward image of the 
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Figure 111.50. Quadrat- 
ic transformations are 
described by showing how 
they act on a Sierpinski 
triangle. Use a magnifying 
glass to check that the 
transformations behave 
locally like similitudes. 



origin, which is a critical point. (II) The transformation maps the space twice around 
the origin. 

One can track analytically what happens to the point 

z = Rcost + iR sinf, 

where R > 0. As the time parameter t goes from zero to In, z moves anticlockwise 
once around the circle of radius R. The transformed point f(z) is given by 

f( z ) = R 2 cos2t + iR 2 s'm2t. 

As the time parameter t goes from 0 to 2n, f(z) moves twice around the circle of 
radius R 2 . 

On the Riemann sphe/e the transformation z h+ z 2 can be described as follows. 
Let us say that the Equator corresponds to the circle of unit radius in the plane, that 
the South Pole corresponds to the Origin, and that the North Pole corresponds to the 
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Point at Infinity. Then the transformation leaves both Poles fixed. The Line of Longi- 
tude L connecting the Poles, which corresponds to the positive real axis, is mapped 
into itself, and the Equator is mapped into itself. Here is what we must picture. First, 
points that lie above the Equator are moved closer to the North Pole; points that lie 
below the Equator are moved closer to the South Pole; and the Equator is not shifted. 
Second, the skin of the sphere is cut along the Line of Longitude L. One side of the 
cut is held fixed while the other side is pulled around the sphere (following the ter- 
minator when the Sun is high above the Equator), uniformly stretching the space, 
until the edge of the cut is back over L. The two lips of the the cut are rejoined. 
The sphere has been mapped twice over itself. The Poles are the critical points of the 
transformation; they are the points about which wrapping occurs. This description is 
illustrated in Figure 111.51. 

The most general quadratic transformation on the sphere is expressible by a for- 
mula of the form f(z) = Az 2 + Bz + C, where A, B, and C are complex numbers. 



Figure 111.51. The ac- 

tion of the quadratic trans- 
formation z i->- z 2 in terms 
of the sphere. 




(1) POINTS ABOVE 
THE EQUATOR MOVE 
CLOSER TO THE 
NORTH POLE; BELOW 
THEY MOVE SOUTH. 



(2)THE SPHERE IS CUT 
ALONG THE LINE OF 
LONGITUDE L. 



(3) ONE EDGE OF THE 
CUT IS PULLED RIGHT 
AROUND THE SPHERE. 
THE SPHERE IS 
COVERED TWICE. 



S 
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Figure 111.52. The set 

valued inverse, /', of 
the quadratic transforma- 
tion f(z) = z 2 , maps the 
Sierpinski triangle AOB 
into POQ U POQ. More 
generally, maps the 
Space of Fractals into it- 
self. Look carefully at this 
image! Several important 
features of analytic trans- 
formations are illustrated 
here. 



One can show there is a change of coordinates, z*-+0(z), where 6 is a similitude, 
such that f(z) becomes expressible in the special form f(z) = z 2 + C for some 
complex number C; see Exercise 5.20 in the following section. Hence the descrip- 
tion of the most general quadratic transformation on the sphere can be made in the 
same terms as above, except that at the end there is a translation by some constant 
amount C. This translation leaves the Point at Infinity fixed. 

The quadratic transformation f(z) = z 2 maps the punctured plane C onto it- 
self twice. Each point on z e C{0} has two preimages. Hence / : C C is not 
an invertible transformation. In such situations we can define a set-valued inverse 
function. 

Definition 4.1 Let f : C C be an analytic transformation such that /(C) = 
C. Then the set-valued inverse of f is the mapping f~ l : H(Q H(Q defined by 

f~\A) = {we€: f(w) e A} for all A e H(X). 

In Figure 111.52 we illustrate the transformation f~ l acting on the Space of Frac- 
tals, in the case of the quadratic transformation f(z) = z 2 . 
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One can obtain explicit formulas for f^ 1 (z) when f is a quadratic transfor- 
mation. For example for f(z) = z 2 , f~ l (0) = O, / _1 (oo) = oo, and f~ 1 (z) = 
{wi(z), u>2(z)} for z e C \ {0, oo}. Here w \(x\ + 1x2) = a(x\,X2) + ib(x\, X2) , and 
W2(x\, X2) = — a(x\, X2) — ib(x\, X2), where 



a(x\, X2) = 



a(x\, X2) = 



b(x u x 2 ) = 



N 



-^1 — f" Xj ~ f" -^1 



N 



X \ ~ f~ X*y ~ f~ X\ 



when JC2 > 0, 



when X2 < 0, 



2 + x 2 

1 ^ A 2 



*1 



Each of the two functions w\(z) and W2(z) is itself analytic on C \ {0, 00}. 

The following definition formalizes what is meant by an analytic transformation 
on the complex plane. We recommend further reading, for example [Rudin, 1966]. 

Definition 4.2 Let (C, d) denote the complex plane with the Euclidean metric. 
A transformation f : C -» C is called analytic if for each zo £ C r/zere z's a similitude 
of the form 



w(z) = az + b, 



for some pair of numbers a, b e <C, 



smc/i that d(f(z), w(z))/d(z, zo) -» 0 as z -» zo- TVie numbers a and b depend on zo- 
If, corresponding to a certain point zo = c, we have a = 0, c w called a critical 
point of the transformation, and f(c) is called a critical value. 

If the analytic transformation f(z) is a rational transformation, which means that 
it is expressible as a ratio of two polynomials in z, such as 

(i)/(z) = 1 + 2i +27z 2 -9z\ 
l + z 



(ii) /(z) = 

(iii) /(z) = 



1 -z 

l + z + z 2 



l-z + z 2, 

then the numbers a and b in the similitude w (z) in Definition 4.2 are given by the 
formulas 

a = f'(zo) and b = f(zo) ~ azo- 

The derivative f'(z) of the rational function f(z) can be calculated by treating z as 
though it were the real variable x and applying the standard differentiation rules of 
calculus. The critical points c e C are the solutions of the equation f'(c) = 0. 
For example, close enough to any point zo e C such that f'(zo) # 0, the cubic 
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transformation (i) is well described by the similitude 

w(z) = (54z 0 - 21z%)z + (1 + 2i - 27z 0 2 + 18z 0 3 ). 
The finite critical points associated with (i) may be obtained by solving 

54c - 27c 2 = 0 

and are accordingly c = 0 + iO and c = 2 + iO. By making the change of coordinates 
z' = l/z (see section 5), one can also analyze the behavior near the point at infinity. 
It turns out that c = oo is always a critical point for a polynomial transformation 
f(z) on C. The space is "wrapped" an integral number of times about the image 
of critical point. For example, the cubic transformation (i) wraps space twice about 
each of the points /(0 + i'O) = 1 + 2i, and f(2 + iO) = 37 + 2i, and it wraps it three 
times about /(oo) — oo. 



Examples & Exercises 

4.1. Sketch a globe representing C, including a subset that looks like Africa, and 
show what happens to the subset under the quadratic transformation f(z) = z 2 - 

4.2. Verify the following explicit formulas for f~ 1 (z), corresponding to f(z) = 
z 2 -\: f~ l (-l) = O; f' l (oo) = oo; and f~ l (z) = {u>i(z), w 2 (z)} for z e C \ 
{-1, oo}, where w i(x\ + ix?) = a. (x\ , x-i) + ib{x\,X2), and 1^2(^1, ^2) = —a{x\,X2) 
- ib(x\, X2). Here 



a(x\ , X2) 



N 



^/(l +x x ) 2 + xl + 1 + Xi 



a(x\, X2) = — 



N 



y<i +x x ) 2 +xi + 1 + xi 



when X2 > 0, 



when X2 < 0, 



and 



b(x u x 2 ) = 



^/(l +Xi) 2 + X 



1 — X\ 



Both w i(z) and W2(z) are analytic on C \ {— 1}. 

4.3. Locate the critical points and critical values of the quadratic transformation 

f(z) = z 2 +l. 

4.4. Draw a side view of a man with an arm stretched out in front of him, holding 
a knife. The blade shojild point down. Choose the origin of coordinates to be his 
navel. Draw another picture to explain how hara-kiri can be achieved by applying 
the inverse of the quadratic transformation f(z) = z 2 to your image. 
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4.5. Find a similitude that approximates the behavior of the given analytic transfor- 
mation in the vicinity of the given point: (a) f(z) = z 2 near zo = 1; (b) f(z) = l/z 
near zo = 1 + i; (c) f(z) = (z - l) 3 near z 0 = 1 - i. 



5 How to Change Coordinates 

In describing transformations on spaces we usually make use of an underlying coor- 
dinate system. Most spaces have a coordinate system by means of which the points 
in the space are located. This underlying coordinate system is implied by the specifi- 
cation of the space: for example, X = [1, 2] provides a collection of points together 
with the natural coordinate x restricted by / < x < 2. We can think of either the 
space, made of points x g X, or equivalently the system of coordinates. If the space 
X is R 2 or C, then the underlying coordinate system may be Cartesian coordinates. 
If X = C, then the coordinate system may be angular coordinates on the sphere. 

In each case the underlying coordinate system is itself a subset of a metric space. 
We denote this metric space by X c . Usually we do not consciously distinguish 
between a point x eX and its coordinate x eX c . Notice, however, that the space 
X c may contain points (coordinates) that do not correspond to any point in the space 
X. For example, in the case of the space X = m it is natural to take X c = R 2 ; then 
points x e X in the space correspond to coordinates x — (x\, X2) g X c restricted by 
0 < x\ < 1 and 0 < xi < 1. However, the coordinates (3, 5) g X c do not correspond 
to a point in X. We would like the reader to think of the space itself as "lying above" 
its coordinate system, as suggested in Figure 111.53. 

A change of coordinate system may be described by a transformation 9 : Xc — >• 
X c . We can think of a change of coordinates being effected by physically moving 
each point jcgXso that it no longer lies above x eX c but instead above the coordi- 
nate x' = 0(x) g X c . Thus we must now distinguish between a point x lying in the 
space, X, from its coordinate x e Xc- Then we want to think of the change of coor- 
dinates 0 : X c -> X c as moving X relative to the underlying coordinate space X c , 
as illustrated in Figure 111.54. 

Example 

5.1. Let X = [1, 2] and take X c to be R. Let 9 : R -+ R be defined by G(x) = 
2x + 1. Then the coordinate of the point x = 1.5 becomes changed to 4. We want 
to think of the space X as being moved relative to the coordinate space X c , which is 
held fixed , as illustrated in Figure 111.55. 

Let 0 : X c -> X c denote a change of coordinates. In order that the new coordinate 
system be useful, it is usually necessary that 9, treated as a transformation from X 
to #(X), be one-to-one and onto, and hence invertible. Let / : X -> X be a transfor- 
mation on a metric space X. We want to consider how the transformation / should 
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be expressed after the change of coordinates. Let x denote simultaneously a point 
in X and the coordinates of that point. Let f(x) denote simultaneously the point to 
which x is transformed by /, and the coordinates of that point. Let x' denote the 
point x e X in the new coordinate system. That is, x' = &(x) € X c denotes the new 
coordinates of the point x. Let /'(*') denote the same transformation / : X -> X, 
but expressed in the new coordinate system. Then the relation between the two co- 
ordinate systems is expressed by the commutative diagram in Figure 111.57, and is 
illustrated in Figure III. 56. 

Theorem 5.1 Let X be a space and let X c ~D\be a coordinate space for X. 
Let a change of coordinates be provided by a transformation 6 :X C -» X c . Let 6 be 
invertible when treated as a transformation from X to 0(X). Let the coordinates of 
a point x 6 X be denoted by x before the change of coordinates, and by x' after the 
change of coordinates, so that 

x' = 0(x). 

l et f : X ^ X be a transformation on the space X. Let x h* f{x) be the formula 
for f expressed in the original coordinates. Let x' h* f'(x') be the formula for f 
expressed in the new coordinates. Then 
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f(x) = (e- l ofoe)(x), 

f{x') = {6ofo6- l )(x'). 



Examples & Exercises 

5.2. Consider an affine transformation f{x) — ax + b, a 7^ 0, a 7^ 1, a,b e\S&. This 
has a fixed point Xf e R defined by /(*/) = Xf. We find Xf — b/(l — a). Xf is 
clearly the interesting point in the action of an affine transformation on K. Accord- 
ingly let us change coordinates to move x/ to the origin: that is x' = 6(x) = x — Xf. 
What does / look like in this new coordinate system? 

f\x') = (0ofo e- l )(x') = 0o(x' + x f )= a{x' + Xf ) + b- x f ; 

f'(x') = ax', which is simply a rescaling! Now using the first formula we get 



Figure 111.55. A x e(x) 

change of coordinates 

for the space [1,2] given _^ | | ( ( 

by the transformation - 1 _ _ 

x' = 9(x) = 2x + 1. 
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Figure 111.56. The 

transformation F acting 
on X is equivalent to F' 
acting on 6(X). 



f(x) = a(x — Xf) + Xf 

and 

f°»( x )=a n (x-Xf) + x f forall«e{0, 1,2,3,...}. 

We now see a new way of visualizing an affine transformation on R: for example, if 
a > 1, we see the image in Figure 111.58. 

5.3. Show that for any affine transformation f(x) : E 2 -» M 2 given by f(x) = 
Ax + t, with fixed point x /, that the coordinate transformation 6(x) — x — xj trans- 
forms the function f'(x') = Ax'. 

5.4. Let X = [1 , 2] and let a change of coordinates be defined by x' — 2x— 1 . Let a 
transformation / : X X be defined by f(x) = (x — l) 2 + 1 . Express /' in the new 
coordinate system. 

Definition 5.1 Let f : X — ► X be a transformation on a metric space. A point 
Xf e X such that f(xf) =x/ is called a fixed point of the transformation. 

The fixed points of a transformation are very important. They tell us which parts 
of the space are pinned in place, not moved, by the transformation. The fixed points 
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of a transformation restrict the motion of the space under nonviolent, nonripping 
transformations of bounded deformation. 

Examples & Exercises 

5.5. Find the fixed points x\ and X2 of the Mbbius transformation 

(z + 2) 



/(*) = 



(4-z) 



on C. Make a change of coordinates so that Jti becomes the origin and X2 becomes 
the point at infinity. Hence interpret the action of f(z) on the sphere in geometrical 
terms. 

5.6. Let W(x) = Ax + t where det A ^ 0 is a two-dimensional affine transforma- 
tion acting on the space X = E 2 . Find the fixed point Xf. Change coordinates so 
that Xf becomes the origin of coordinates. Hence describe the action geometri- 
cally of a two-dimensional, nondegenerate affine transformation. What can happen 
if det A = 07 

5.7. Suppose we can find a coordinate transformation BAB~ l = D, where D is a 
diagonal matrix we denote by 



Figure 111.57. Com- 
mutative diagram for 
the coordinate change 

0 '. Xc — > X(;. 




ORIGINAL 
COORDINATES 



NEW 

COORDINATES 



Figure 111.58. An affine 
transformation on K.. We 
see rescaling (magnifica- 
tion or diminution) cen- 
tered at the fixed point, 
together with a flip of 180° 
if a < 0. 
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\0 -X 2 J 
Show that A.i and X2 satisfy the equation 

-(•;' 

5.8. Analyze the behavior of the affine transformation w{z) = lz + 1 on € near the 
point at infinity by making the change of coordinates h{z) = l/z- 

5.9. Two one-parameter families of transformations on R are f^{x) = x 2 — /x and 
gx(x) = kx(l - x), where /x and X are real parameters. Find a change of coordinates 
and a function /x = so that f^ix') = gx(x') is valid for an appropriate interval 
on the X-axis. 

5.10. Find the real fixed points of g(x) — x 2 - \. Analyze the behavior of g near 
each of its fixed points by changing coordinates so as to move first one then the other 
to the origin. Another method for looking at the behavior of g near a fixed point is 
to approximate g(x) by the first two terms of its Taylor series expansion about the 
fixed point. Compare these methods. 

5.11. Suppose that the 2 x 2 matrix 

'■(: 0 

satisfies the condition (tr A) 2 - 4 det A > 0. Show that there is a B such that 

BAB 1 = D, 

where D is a diagonal matrix. Furthermore, show that one choice for B is given by 

'< i> 

What do you think happens if (tr A) 2 - 4 det A < 0? 

5.12. Let w : IR 2 E 2 denote the affine transformation 

Make a change of coordinates so that the transformation is simply a coordinate 
rescaling. What are the rescaling factors? 

Definition 5.2 Let F denote a set of transformations on a metric space X. F 
is called a semigroup if f g e F implies f o g e F. F is called a group if it is a 
semigroup of invertible transformations, and f e F implies f' x e F 

We introduce this definition because we will use semigroups (and groups) of 
transformations both to characterize and to compute fractal subsets of X. However, 
we do not use any deep theorems from group theory. 



e-X 



f 
h-X 



X 2 - tr AX + det A = 0. 
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Examples & Exercises 

5. 1 3. Let / : X — > X be a transformation on a metric space. Show that the set of 
transformations {/ on : n=0, 1, 2, 3, . . .} forms a semigroup. 

5.14. A transformation T : E ->> E on code space is defined by 

T {X\X2X 2X4X5 . . .) = . . . 

and is called a shift operator. Describe the semigroup of transformations {7"°" : n = 
0, 1 , 2, 3, . . .}. What are the fixed points of T° 3 if the code space is built up from the 
two symbols {0, 1}? 

5. 15. Show that the set of Mobius transformations on R forms a group. 

5. 1 6. Show that the set of Mobius transformations on C forms a group. 

5. 1 7. Show that the set of invertible affine transformations on R 2 forms a group. 

5.18. Show that the set of transformations / : R 2 — ► R 2 such that /(A) c A forms 
a semigroup. 

5.19. Show that a group of transformations is provided by the set of affine trans- 
formations of the form w(x) = Ax + t, where A = ( f ^ | for a, b, c e R, with 
ac 7^ 0, and the translation vector t is arbitrary. 

5.20. The most general analytic quadratic transformation / : C — > C can be ex- 
pressed by a formula of the form f(z) = Az 2 + Bz + C, where A, B, and C are 
complex numbers, and A ^ 0. Show that by means of a suitable change of coordi- 
nates, z' = 6(z), where 6 is a similitude, show that f(z) can be reexpressed as a 
quadratic transformation of the special form f'(z) = {z') 2 + C for some complex 
number C. 



6 The Contraction Mapping Theorem 

Definition 6.1 A transformation f : X — > X on a metric space (X, d) is called 
contractive or a contraction mapping if there is a constant 0 < s < 1 such that 

d(f(x),f(y))<s.d(x,y)Vx, y e X. 

Any such number s is called a contractivity factor for f . 

It would be convenient to be able to talk about the largest number and the smallest 
number in a set of real numbers. However, a set such as S = (—00, 3) does not 
possess either. This difficulty is overcome by the following definition. 

Definition 6.2 Let S denote a set of real numbers. Then the infimum of S is 

equal to —00 if S contains negative numbers of arbitrarily large magnitude. Other- 
wise the infimum of S = max{;c e R : x < s for all 5 e S}. The infinum of S always 
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exists because of the nature of the real number system, and it is denoted by inf S. 
The supremum of S is similarly defined. It is equal to +00 if S contains arbitrarily 
large numbers; otherwise it is the minimum of the set of numbers that are greater 
than or equal to all of the numbers in S. The supremum of S always exists, and it is 
denoted by sup S. 

Examples & Exercises 

6.1. Find the supremum and the infimum of the following sets of real numbers: (a) 
(-00, 3); (b) C, the Classical Cantor Set; (c) {1, 2, 3, 4, . . .}; (d) the positive real 
numbers. 

6.2. Let / : X X be a contraction mapping on a compact metric space (X, d). 
Show that inf {5 e R : s is a contractivity factor for /} is a contractivity factor for /. 

6.3. Show that if / : X X and g : X X are contraction mappings on a space 
(X,d), with contractivity factors s and t, respectively, then / o g is a contraction 
mapping with contractivity factor st. 

Theorem 6. 1 [(The Contraction Mapping Theorem).] Let f :X^Xbe a con- 
traction mapping on a complete metric space (X, d). Then f possesses exactly one^ 
fixed point x f e X and moreover for any point xeX, the sequence {f on (x) : n = 
0, 1, 2, . . .} converges tox f . That is, 

lim f° n (x)=x f , for each x eX. 

n-+oo 

Figure 111.59 illustrates the idea of a contractive transformation on a compact 
metric space. 

Proof Let x e X. Let 0 < s < 1 be a contractivity factor for /. Then 

d(f on (x), f om (x)) < s mAn d(x, / 0| "- m| )(x) (1) 

for all m, n = 0, 1, 2, . . ., where we have fixed x e X. The notation u a v denotes 
the minimum of the pair of real numbers u and v. In particular, for k = 0, 1, 2, . . ., 
we have 

d( X , r k (x)) < d( X , fix)) + (fix), r 2 ix)) + ■■■+ dir^ l \x), r k ix)) 

<(l +s + s 2 + ...+s k - l )dix,fix)) 
<(\-sy l d(x,f(x)), 
so substituting into equation (1) we now obtain 

d(f on (x), f om (x)) < s mAn ■ (1 - s)- 1 ■ (d(x, f(x)), 

from which it immediately follows that {/°"(*)}~ 0 is a Cauch y sequence. Since X 
is complete this Cauch? sequence possesses a limit x f e X, and we have 

lim f on (x)=x f . 
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Figure 111.59. (a) Illus- 
trates the idea of a con- 
tractive transformation on 
a metric space, (b) A con- 
traction mapping doing 
its work, drawing all of a 
compact metric space X 
toward the fixed point. 




(a) 



(b) 




Now we shall show that x f is a fixed point of /. Since / is contractive it is continu- 
ous and hence 

/(*,) = /(lim f on (x))= lim f*» +l \x)=x f . 

n—>-oo n—>oo 

Finally, can there be more than one fixed point? Suppose there are. Let x / and y f be 
two fixed points of /. Then x/ — f(x/), yj = /(>>/), and 

d(x f , y f ) =d(f(x f ), /(*/)) < sd(x f , y f ), 
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where (1 - s)d(x f , y f ) < 0, which implies d(x f , y/) = 0 and hence x f = y f . This 
completes the proof. 

Examples & Exercises 

6.4. Let w(x) = Ax + 1 be an affine transformation in two dimensions. Make the 
change of coordinates h(x) = x' = x - x f , under the assumption that det(7 — A) ^ 
0, and show that w'(x') = h o w o h~ l (x') = Ax', that w(x) = (h~ l o w' o /»)(*) = 
A(x — */) + x/, and hence that 

w on (x) = A n (x -x f )+Xf forn =0,1,2,3,.... (2) 

Give conditions on A such that it is contractive (a) in the Euclidean metric, and (b) in 
the Manhattan metric. Show that if | A\ < 1, where \A \ denotes any appropriate norm 
of A viewed as a linear operator on a two-dimensional vector space, then {w on (x)} 
is a Cauchy sequence that converges to Xf, for each xel 2 . 

6.5. Let /:■->■ be a contraction mapping on (■, Euclidean). Show that Fig- 
ure 111.59 gives the right idea. 

6.6. Let / : R -> R be the affine transformation f(x) = \x + \. Verify / is a 
contraction mapping and deduce 

lim f on (x) = x f for each x e R. 

n— >-oo 

Use this formula with x = 1 to obtain a geometrical series for the fixed point x f e R. 
Observe, however, f(R) = R; indeed / is invertible. 

6.7. Let (X, d) be a compact metric space that contains more than one point. Show 
that the situation in exercise 6.6 cannot occur for any contraction mapping / : X -> 
X. That is, show that /(X) C X but /(X) # X. That is, show that a contraction 
mapping on a nontrivial compact metric space is not invertible. Hint: use the com- 
pactness of the space to show that there is a point in the space that is farthest away 
from the fixed point. Then show that there is a point that is not in /(X). 

6.8. Show that the set of contraction mappings on a metric space forms a semi- 
group. 

6.9. Show that the affine transformation w : A -> A defined by w(x) — Ax + t is a 
contraction, where 

/>sl20° -isinl20°X 
\\ sin 120° \ cos 120°/ \ 0 / 
Here A is an equilateral Sierpinski triangle with a vertex at the origin and one at 
(1,0). You need to begin by verifying that w does indeed map A into itself! Locate 
the fixed point x f . Make a picture of this contraction mapping "doing its work, 
mapping all of the compact metric space A toward the fixed point." Use different 
colors to denote the successive regions / o(M) (A) \ / o(M+1) (A) for n — 0, 1, 2, 3, . . .. 
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Figure 111.60. The ex- 
istence of a positive 
eigenvalue of an "angle- 
squeezing" linear transfor- 
mation. 




6. 10. Define a mapping on the code space of two symbols {0, 1} by /(xiX2*3*4 • • •) 
= \x\X2X3X4 — (Recall that the metric is d(x, y) = YlHi |JC 'y y ' 1 , or equivalent.) 
Show that / is a contraction mapping. Locate the fixed point of /. 

6.1 1. Let (X, d) be a compact metric space, and let / : X — > X be a contraction 
mapping. Show that {/°"(X)}^1 0 is a Cauchy sequence of points in (H(X), h) and 
lim„^oo f on (X) = {xf}, where xj is the fixed point of /. 

6.12. Let (X, d) be a compact metric space. Let / : X — > X have the property 
lim„^oo f on (X) = Xf. Find a metric d on X such that / is a contraction mapping, 
and the identity is a homeomorphism from (X, d) — > (X, d). 

6.13. Let Ax — ^ (jc' ) w ^ a 'b,c,d e IR, all strictly positive, be a lin- 
ear transformation on U 2 . Show that A maps the positive quadrant {(x\, x 2 ) : *i > 
0, x 2 > 0} into itself. Let a mapping / : [0, 90°] -> [0, 90°] be defined by 

A /cos0\ , . . , . /cos/(0)\ 

A . „ = (some positive number) I . _ . 
Vsin^y \smf(6)J 

Show that {f on (0)} converges to the unique fixed point of /. Deduce that there 

(cos 6 \ 
1 = 
sm8 ) 

XI C ° S f ).See Figure 111.60. 
\ sin 0 ) 
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7 Contraction Mappings on the Space of Fractals 

Let (X, d) be a metric space and let (H(X), h(d)) denote the corresponding space 
of nonempty compact subsets, with the Hausdorff metric h(d). We introduce the 
notation h(d) to show that d is the underlying metric for the Hausdorff metric h. 
For example, we may discuss (H(Q, h (spherical)) or (H(R 2 ), h (Manhattan)). We 
will drop this additional notation when we evaluate Hausdorff distances. 

We have repeatedly refused to define fractals: we have agreed that they are sub- 
sets of simple geometrical spaces, such as (0& 2 , Euclidean) and (<C, Spherical). If 
we were to define a deterministic fractal, we might say that it is a fixed point of a 
contractive transformation on (H(X), h(d)). We would require that the underlying 
metric space (X, d) be "geometrically simple." We would require also that the con- 
traction mapping be constructed from simple, easily specified, contraction mappings 
on (X, d), as described below. 

Lemma 7.1 Let w.X^-X be a contraction mapping on the metric space 
(X,d). Then w is continuous. 

Proof Let e > 0 be given. Let s > 0 be a contractivity factor for w. Then 

d(w (x), w(y)) < sd(x, y) < e 

whenever d {x, y) < 8, where 8 = e/s. This completes the proof. 

Lemma 7.2 Let w : X X be a continuous mapping on the metric space 
(X, d). Then w maps H(X) into itself. 

Proof Let S be a nonempty compact subset of X. Then clearly w(S) = {w (x) : 
jc G S} is nonempty. We want to show that w (S) is compact. Let {y n = w(x n )} be 
an infinite sequence of points in S. Then {x n } is an infinite sequence of points in S. 
Since S is compact there is a subsequence {x N J that converges to a point x e S. But 
then the continuity of w implies that [y Nn = f(x N J} is a subsequence of {y n } that 
converges to y = f(x) e w (S). This completes the proof. 

The following lemma tells us how to make a contraction mapping on (H(X), h) 
out of a contraction mapping on (X, d). 

Lemma 7.3 Let w.X -> X be a contraction mapping on the metric space 
(X, d) with contractivity factor s. Then w : H(X) -> H(X) defined by 

W (B) = {w(x) : x g B}VB e H(X) 

is a contraction mapping on (H(X), h{d)) with contractivity factor s. 

Proof From Lemma 7.1 it follows that w : X -> X is continuous. Hence by 
Lemma 7.2 w maps H(X) into itself. Now let B, C e H(X). Then 

d(w(B), w(C)) = max{min{d(w(x, y), w(y)) : y e C) : x e B} 

< max{min{s • d(x, y) : y e C) : x e B} = s ■ d(B, C). 
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Similarly, d(w(C), w(B)) < s ■ d(C, B). Hence 

h(w(B), w(C)) = d(w(B), w(C)) v d(w(C), w(B)) < s ■ d(B, C) v d(C, B) 
<s -d(B, C). 
This completes the proof. 

The following lemma gives a characteristic property of the Hausdorff metric 
which we will shortly need. The proof follows at once from exercise 6. 1 3 of Chapter 
II. 

Lemma 7.4 For all B, C, D, and E, in H(X) 

h(B U C, D U E) < h(B, D) v h(C, E), 

where as usual h is the Hausdorff metric. 

The next lemma provides an important method for combining contraction map- 
pings on (H(X),h) to produce new contraction mappings on (H(X),h). This 
method is distinct from the obvious one of composition. 

Lemma 7.5 Let (X, d) be a metric space. Let {w n : n = 1,2, . . . , N} be con- 
traction mappings on (H(X), h). Let the contractivity factor for w n be denoted by s n 
for eachn. Define W :H(X)^ H(X) by 

W(B) = w x {B) U w 2 (B) U . . . U w n (B) 

= U n n=] w n (B), for each B e H(X). 

Then W is a contraction mapping with contractivity factor s = max{s n : n = 1, 2, 
...,N}. 

Proof We demonstrate the claim for N = 2. An inductive argument then com- 
pletes the proof. Let B, C e H(X). We have 

h(W(B), W(C)) = h(w x (B) U w 2 (B), w x (C) U w 2 {C)) 

< h(w x (B), w x {C)) v h{w 2 {B), w 2 (C)) (by Lemma 7.2) 

< s x h(B, C) v s 2 h{B, C) < sh(B, C). 
This completes the proof. 

Definition 7.1 A (hyperbolic) iterated function system consists of a complete 
metric space (X, d) together with a finite set of contraction mappings w n : X -> 
X, with respective contractivity factors s n , for n = 1,2, . . . , N. The abbreviation 
"IFS" is used for "iterated function system." The notation for the IFS just an- 
nounced is {X; w n , n = \,2, . . . , N] and its contractivity factor is s = max{5„ : n = 

i,2,..., yv}. 

We put the word "hyperbolic" in parentheses in this definition because it is some- 
times dropped in practice. Moreover, we will sometimes use the nomenclature "IFS" 
to mean simply a finite set of maps acting on a metric space, with no particular con- 
ditions imposed upon the maps. 

The following theorem summarizes the main facts so far about a hyperbolic IFS. 
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Theorem 7.1 Let {X; w„, n = 1,2, . . . , N} be a hyperbolic iterated function 
system with contractivity factor s. Then the transformation W : H(X) -> H(X) de- 
fined by 

W(B) = U n n=l w n (B) 

for all B g H(X), is a contraction mapping on the complete metric space (H(X), 
h(d)) with contractivity factor s . That is 

h(W(B), W(C))<s -h(B, C) 

for all B,C e H(X). Its unique fixed point, A e H(X), obeys 

A = W(A) = U n n=l w n (A) 

and is given by A = lim^oo W on (B)for any B e H{X). 

Definition 7.2 The fixed point A e H(X) described in the theorem is called the 
attractor of the IFS. 

Sometimes we will use the name "attractor" in connection with an IFS that is 
simply a finite set of maps acting on a complete metric space X. By this we mean 
that one can make an assertion analagous to the last sentence of Theorem 7.1 . 

We wanted to use the words "deterministic fractal" in place of "attractor" in 
Definition 7.2. We were tempted, but resisted. The nomenclature "iterated function 
system" is meant to remind one of the name "dynamical system." We will introduce 
dynamical systems in Chapter 4. Dynamical systems often possess attractors, and 
when these are interesting to look at they are called strange attractors. 

Examples & Exercises 

7.1. This exercise takes place in the metric spaces (R, Euclidean) and (H(R), 
h (Euclidean)). Consider the IFS {R; w\, wj), where w\{x) = ^x and W2(x) — \x + 
|. Show that this is indeed an IFS with contractivity factor s = |. Let B 0 = [0, 1]. 
Calculate B n = W on (B 0 ), n = 1, 2, 3, . . .. Deduce that A = lim^oo B n is the classi- 
cal Cantor set. Verify directly that A = |AU{|A + |}. Here we use the following 
notation: for a subset A of R, xA = {xy : y e A} and A + x = {y + x : y e A}. 

7.2. With reference to example 7.1, show that if w\{x) — s\x and W2(x) = (1 — 
s\)x + s\, where s\ is a number such that 0 < s\ < 1, then B\ = Z?2 = #3 — ■ ■ •■ Find 
the attractor. 

7.3. Repeat example 7.1 with w\(x) — |jc and w 2 (x) — ^x + \. In this case A = 
lim^oo B n will not be the classical Cantor set, but it will be something like it. 
Describe A. Show that A contains no intervals. How many points does A contain? 

7.4. Consider the IFS {R; \x + |, \x, \x + |}. Verify that the attractor looks like 
the image in Figure 111.61. Show, precisely, how the set in Figure 111.61 is a union of 
three "shrunken copies of itself." This attractor is interesting: it contains countably 
many holes and countably many intervals. 
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Figure 111.61. Attractor 
for three affine maps on 
the real line. Can you find 
the maps? 




Figure 111.62. A se- Ay 

quence of sets converging • 
to a line segment. 
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7.5. Show that the attractor of an IFS having the form {R; w\ (x) =ax +b, w 2 (x) = 
cx + d}, where a, b, c, and d e R, is either connected or totally disconnected. 

7.6. Does there exist an IFS of three affine maps in R 2 whose attractor is the union 
of two disjoint closed intervals? 

7.7. Consider the IFS 



Let A 0 = {(|, y) : 0 < y < 1}, and let W on (A Q ) = A n , where W is defined on H(R 2 ) 
in the usual way. Show that the attractor is A = {(x, y) : x — y,0 < x < 1} and 
that Figure 111.62 is correct. Draw a sequence of pictures to show what happens if 
A 0 = {(*, y) € R 2 : 0 < x < 1, 0 < y < 1}. 

7.8. Consider the attractor for the IFS {R; wi(x) = 0, w 2 (x) = \x + |}. Show that 
it consists of a countable increasing sequence of real points {x n : n = 0, 1,2,...} 
together with {1}. Show that x n can be expressed as the nth partial sum of an infinite 
geometric series. Give a succinct formula for x n . 

7.9. Describe the attractor A for the IFS {[0, 2]; w { (x) = \x 2 , w 2 (x) = \x + \ ) by 
describing a sequence of sets which converges to it. Show that A is totally discon- 
nected. Show that A is perfect. Find the contractivity factor for the IFS. 

7.10. Let (r, 6), 0 < r < oo, 0 < 0 <2n denote the polar coordinates of a point 
in the plane, R 2 . Define w x {r, 0) = {\r + \, \0), and w 2 (r, 0) = (f r + \,\0 + 
^f). Show that {R 2 ; w\, w 2 } is not a hyperbolic IFS because both maps w\ and w 2 
are discontinuous on the whole plane. Show that {R 2 ; w\, w 2 } nevertheless has an 
attractor; find it (just consider r and 0 separately). 

7. 11. Show that the sequence of sets illustrated in Figure 111.63 can be written in 
the form A n = W on (A 0 ) forn = 1, 2, . . ., and find W : H(R 2 ) -> H(R 2 ). 

7. 1 2. Describe the collection of functions that constitutes the attractor A for the IFS 



Find the contractivity factor for the IFS. 

7.13. Let C°[0, 1] = {/ € C[0, 1] : /(0) = f(l) = 0}, and define d(f, g) - 
max{|/(x) - g(x)\ :xe[0, 1]}. Define w x : C°[0, 1] -> C°[0, 1] by = 
\f(2x mod 1) + 2x(l - x) and (w 2 (f))(x) = \f{x). Show that {C°[0, 1]; w u w 2 ) 
is an IFS, find its contractivity factor, and find its attractor. Draw a picture of the 
attractor. 

7.14. Find conditions Such that the Mobius transformation w{x) = {ax + b)/(cz + 
d),a, b, c,d e C, ad - k ^ 0, provides a contraction mapping on the unit disk 




0 x - 1 



Figure 111.63. The first 
three sets A 0 , A u and A 2 
in a convergent sequence 
of sets in Ti(K 2 ). Can 
you find a transformation 
W : H(R 2 ) -+ H(R 2 ) 
such that A n+l = W(A„)1 
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X = {zeC:|z|<l}. Find an upper bound for the contractivity factor. Construct an 
IFS using two Mobius transformations on X, and describe its attractor. 

7. 15. Show that a Mobius transformation on C is never a contraction in the spheri- 
cal metric. 

7.16. Let (E, d) be the code space of three symbols {0, 1, 2}, with metric 

OO I i 

d(x,y) = ^— — . 

n=\ 

Define w\ : E -> £ by w\(x) = Ox 1X2X2 . . . and wjix) = 2*1*2*3 — Show that w\ 
and W2 are both contraction mappings and find their contractivity factors. Describe 
the attractor of the IFS {£; w\, W2}. What happens if we include in the IFS a third 
transformation defined by w^x = Ix 1X2X3 . . .? 

7.17. Let A c 0& 2 denote the compact metric space constisting of an equilateral 
Sierpinski triangle with vertices at (0, 0), (1, 0), and (5, and consider the IFS 
{A, + \, \e 2ni/3 z + \) where we use complex number notation. Let Aq = A, 
and A n = W on (A 0 ) for n = 1, 2, 3, . . .. Describe A u A 2 , and the attractor A. What 
happens if the third transformation w-s(z) = \z + \ + (\/3/4)i is included in the 
IFS? 



8 Two Algorithms for Computing Fractals from Iterated Function Systems 

In this section we take time out from the mathematical development to provide two 
algorithms for rendering pictures of attractors of an IFS on the graphics display de- 
vice of a microcomputer or workstation. The reader should establish a computer- 
graphical environment that includes one or both of the software tools suggested in 
this section. 

The algorithms presented are (1) the Deterministic Algorithm and (2) the Random 
Iteration Algorithm. The Deterministic Algorithm is based on the idea of directly 
computing a sequence of sets {A n = W on (A)} starting from an initial set A 0 . The 
Random Iteration Algorithm is founded in ergodic theory; its mathematical basis 
will be presented in Chapter IX. An intuitive explanation of why it works is pre- 
sented in Chapter IV. We defer important questions concerning discretization and 
accuracy. Such questions are considered to some extent in later chapters. 

For simplicity we restrict attention to hyperbolic IFS of the form {R 2 ; w n : n = 
1, 2, . . . , N}, where each mapping is an affine transformation. We illustrate the al- 
gorithms for an IFS whose attractor is a Sierpinski triangle. Here's an example of 
such an IFS: 

[xxl [0.5 0 1U1 Til 
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w 2 



Wt, 







"0.5 


0 " 






x 2 _ 




0 


0.5 _ 




_x 2 


Xl~ 




"0.5 


0 " 




'xi 


X2 




0 


0.5 




X2 



+ 



+ 



1 

50 

'25 
50 



This notation for an IFS of affine maps is cumbersome. Let us agree to write 

a,- bi x\ 
Ct d t x 2 



Wj(x) = Wi 



x\ 

X2 



+ 



= AiX + ti. 



Then Table III. 1 is a tidier way of conveying the same iterated function system. 

Table III. 1 also provides a number p t associated with io,- for i = 1, 2, 3. These 
numbers are in fact probabilities. In the more general case of the IFS {X; w n : n = 
1, 2, . . . , N}, there would be N such numbers {p, - : i = 1 , 2, . . . , N} that obey 

P\ + Pi + P3 H 1- p n = 1 and p t > 0 for i = 1 , 2, . . . , N. 

These probabilities play an important role in the computation of images of the at- 
tractor of an IFS using the Random Iteration Algorithm. They play no role in the 
Deterministic Algorithm. Their mathematical significance is discussed in later chap- 
ters. For the moment we will use them only as a computational aid, in connection 
with the Random Iteration Algorithm. To this end we take their values to be given 
approximately by 

| det Ai\ Widi - biCi\ 



for / = 1,2, ...,N. 



Here the symbol & means "approximately equal to." If, for some /, det A, = 0, 
then pi should be assigned a small positive number, such as 0.001 . Other situations 
should be treated empirically. We refer to the data in Table III. 1 as an IFS code. 
Other IFS codes are given in Tables III.2, III.3, and III.4. 

Algorithm 8.1 The Deterministic Algorithm. Let {X; w u w 2 , w N } be a 

hyperbolic IFS. Choose a compact set A 0 C Then compute successively A n = 
W° n (A) according to 

A n+ i=U n j=l Wj(A n ) for/i = 1, 2, .... 

Thus construct a sequence {A n : n = 0, 1, 2, 3, ...} C H(X). Then by Theorem 7.1 
the sequence {A n } converges to the attractor of the IFS in the Hausdorff metric. 



Table III. 1 . IFS code for a Sierpinski triangle. 



w 


a 


b 


c 


d 


e 


/ 


P 


1 


0.5 


0 


0 


0.5 


1 


1 


0.33 


2 


0.5 


0 


0 


0.5* 


1 


50 


0.33 


3 


0.5 


0 


0 


0.5 


50 


50 


0.34 
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Table III. 2. IFS code for a square. 



w 


a 


b 


c 


d 


e 


/ 


P 


1 


0.5 


0 


0 


0.5 


1 


1 


0.25 


2 


0.5 


0 


0 


0.5 


50 


1 


0.25 


3 


0.5 


0 


0 


0.5 


1 


50 


0.25 


4 


0.5 


0 


0 


0.5 


50 


50 


0.25 



Table 111.3. IFS code for a fern. 



w 


a 


b 


c 


d 


e 


/ 


P 


1 


0 


0 


0 


0.16 


0 


0 


0.01 


2 


0.85 


0.04 


-0.04 


0.85 


0 


1.6 


0.85 


3 


0.2 


-0.26 


0.23 


0.22 


0 


1.6 


0.07 


4 


-0.15 


0.28 


0.26 


0.24 


0 


0.44 


0.07 



Table 111.4. IFS code for a fractal tree. 



w 


a 


b 


c 


d 


e 


/ 


P 


1 


0 


0 


0 


0.5 


0 


0 


0.05 


2 


0.42 


-0.42 


0.42 


0.42 


0 


0.2 


0.4 


3 


0.42 


0.42 


-0.42 


0.42 


0 


0.2 


0.4 


4 


0.1 


0 


0 


0.1 


0 


0.2 


0.15 



We illustrate the implementation of the algorithm. The following program com- 
putes and plots successive sets A n+ \ starting from an initial set Aq, in this case a 
square, using the IFS code in Table III. 1 . The program is written in BASIC. It should 
run without modification on an IBM PC with Color Graphics Adaptor or Enhanced 
Graphics Adaptor, and Turbobasic. It can be modified to run on any personal com- 
puter with graphics display capability. On any line the words preceded by a ' are 
comments and not part of the program. 

Program 1 . (Example of the Deterministic Algorithm) 
screen 1 : els 'initialize graphics 

dim s( 100, 100) : dim t( 100, 100) 'allocate two arrays of pixels 
a(l)=0.5:b(l)=0:c(l)=0:d(l)=0.5:e(l)=l:f (1)=1 'input the IFS code 
a(2)=0.5:b(2)=0:c(2)=0:d(2)=0.5:e(2)=50:f (2)=1 
a(3)=0.5:b(3)=0:c(3)=0:d(3)=0.5:e(3)=25:f (3)=50 
for i=l to 100 'input the initial set A(0) , in this case 
a square, into the array t(i,j) 
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t(i,l)=l: pset(i,l) 'A(O) can be used as a condensation set 
t(l,i)=l:pset(l,i) 'A(O) is platted on the screen 
t(100,i)=l:pset(100,i) 
t(i,100)=l:pset(i, 100) 
next : do 

for i=l to 100 'apply W to set A(n) to make A(n+1) in the 

array s(i,j) 
for j=l to 100 : if t(i,j)=l then 

s(a(l)*i+b(l)*j+e(l) ,c(l)*i+d(l)*j+f (1))=1 'and apply W to A(n) 
s(a(2)*i+b(2)*j+e(2) ,c(2)*i+d(2)*j+f (2))=1 
s(a(3)*i+b(3)*j+e(3) ,c(3)*i+d(3)*j+f (3) )=1 
end if : next j : next i 

els 'clears the screen — omit to obtain sequence with a A(0) as 

condensation set (see section 9 in Chapter II) 
for i=l to 100 : for j=l to 100 
t(i, j)=s(i, j) 'put A(n+1) into the array t(i,j) 
s(i,j)=0 'reset the array s(i,j) to zero 
if t(i,j)=l then 
pset(i,j) 'plot A(n+1) 
end if : next : next 

loop until instat 'if a key has been pressed then stop, 
otherwise compute A(n+l)=W(A(n+l) ) 

The result of running a higher-resolution version of this program on a Masscomp 
5600 workstation and then printing the contents of the graphics screen is presented 
in Figure 111.64. In this case we have kept each successive image produced by the 
program. 

Notice that the program begins by drawing a box in the array t(i, j). This box has 
no influence on the finally computed image of a Sierpinski triangle. One could just 
as well have started from any other (nonempty) set of points in the array t(i, j), as 
illustrated in Figure 111.65. 

To adapt Program 1 so that it runs with other IFS codes will usually require 
changing coordinates to ensure that each of the transformations of the IFS maps the 
pixel array s(i, j) into itself. Change of coordinates in an IFS is discussed in exercise 
10.14. As it stands in Program 1, the array s(i, j) is a discretized representation of 
the square in R 2 with lower left comer at (1, 1) and upper right comer at (100, 100). 
Failure to adjust coordinates correctly will lead to unpredictable and exciting results! 

Algorithm 8.2 The Random Iteration Algorithm. Let {X; w u w 2 ,..., w N } 
be a hyperbolic IFS, where probability p ( > 0 has been assigned to to u>,- for i = 
1, 2, . . . , N, where Xw=i A = 1- Choose x 0 e X and then choose recursively, inde- 
pendently, 

x n e {wi(x„-{), w 2 (x„-i), . . . , w N (x n -i)} for n = 1, 2, 3, ... , 
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where the probability of the event x n = Wi(x„-i) is p t . Thus, construct a sequence 
{x n :n=0, 1,2,3, ...} c X. 

★ The reader should skip the rest of this paragraph and come back to it after 
reading Section 9. If {X, w 0 , w u w 2 , w N } is an IFS with condensation map w 0 
and associated condensation set C C H(X), then the algorithm is modified by (a) 
attaching a probability p 0 > 0 to w 0 , so now J2"=o p { = 1; (b) whenever w 0 (x n -i) 
is selected for some n, choose x n "at random" from C. Thus, in this case too, we 
construct a sequence {x n : n = 0, 1, 2, . . .} of points in X. 

The sequence {*„}£i 0 "converges to" the attractor of the IFS, under various con- 
ditions, in a manner that will be made precise in Chapter IX. 

We illustrate the implementation of the algorithm. The following program com- 
putes and plots a thousand points on the attractor corresponding to the IFS code in 
Table III. 1 . The program is written in BASIC. It runs without modification on an 



Figure 111.64. The re- 
sult of running the De- 
terministic Algorithm 
(Program 1) with various 
values of N, for the IFS 
code in Table III. 1 . 
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IBM PC with Enhanced Graphics Adaptor and Turbobasic. On any line the words 
preceded by a ' are comments: they are not part of the program. 

Program 2. (Example of the Random Iteration Algorithm) 
'Iterated Function System Data 



a[l] 


= 0 


5 : b[l] 


=0 


c[l] 


=0 


d[l] 


= .5 


e[l] =1 : 


f[l] =1 


a [2] 


= 0 


5 : b[2] 


=0 


c[2] 


=0 


d[2] 


= .5 


e[2] =50 


: f[2] =1 


a [3] 


= 0 


5 : b[3] 


=0 


c[3] 


=0 


d[3] 


= .5 


e[3] =50 


: f[3] =50 



screen 1 : els 'initialize computer graphics 

window (0,0) -(100, 100) 'set plotting window to 0<x<100, 0<y<100 
x =0 : y = 0: numits =1000 'initialize (x,y) and define 
the number of iterations, numits 




Figure 111.65. The re- 
sult of running the Deter- 
ministic Algorithm (Pro- 
gram 1), again for the IFS 
code in Table III.l, but 
starting from a different 
initial array. The final re- 
sult is always the same! 
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Figure 111.66. The result of running the Random Iteration Algorithm for increasing 
numbers of iterations. The randomly dancing point starts to suggest the structure of the 
attractor of the IFS given in Table III.3. 



for n =1 to numits 'Random Iteration begins! 

k = int (3*rnd-0 . 00001) +1 'choose one of the numbers 1, 2, 
and 3 with equal probability 

'apply affine transformation number k to (x,y) 

newx =a[k] *x+b[k] *y+e [k] : newy =c [k] *x+d [k] *y+f [k] 

x =newx : y =newy 'set (x,y) to the point thus obtained 

if n > 10 then pset (x,y) 'plot (x,y) after the first 10 

iterations 
next : end 

The result of running an adaptation of this program on a Masscomp workstation 
and then printing the contents of the graphics screen is presented in Figure 111.66. 
Notice that if the size of the plotting window is decreased, for example by replacing 
the window call by WINDOW (0,0)-(50,50), then only a portion of the image is 
plotted, but at a higher resolution. Thus we have a simple means for "zooming in" 
on images of IFS attractors. The number of iterations may be increased to improve 
the quality of the computed image. 

Examples & Exercises 

8. 1 . Rewrite Programs 1 and 2 in a form suitable for your own computer environ- 
ment, then run them and obtain hardcopy of the output. Compare their performance. 

8.2. Modify Programs 1 and 2 so that they will compute images associated with the 
IFS code given in Table III.2. 
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8.3. Modify Program 2 so that it will compute images associated with the IFS codes 
given in Tables III. 3 and III.4. 

8.4. By changing the window size in Program 2, obtain images of "zooms" on the 
Sierpinski triangle. For example, use the following windows: (1,1)- (50, 50); (1, 
1) - (25, 25); (1,1)- (12, 12); . . . ; (1, 1) - (N, N). How must the total number of 
iterations be adjusted as a function of N in order that (approximately) the number 
of points that land within the window remains constant? Make a graph of the total 
number of iterations against the window size. 

8.5. What should happen, theoretically, to the sequence of images computed by 
Program 1 if the set A 0 is changed? What happens in practice? Make a computa- 
tional experiment to see if there is any difference in say Ai 0 corresponding to two 
different choices for A 0 . 

8.6. Rewrite Program 2 so that it applies the transformation w t with probabil- 
ity p h where the probabilities are input by the user. Compare the number of it- 
erations needed to produce a "good" rendering of the Sierpinski triangle, for the 
cases (a) Pl = 0.33, p 2 = 0.33, p 3 = 0.34; (b) p x = 0.2, p 2 = 0.46, p 3 = 0.34; (c) 
Pi =0.1, p 2 = 0.56, p 3 =0.34. 



9 Condensation Sets 

There is another important way of making contraction mappings on H(X). 

Definition 9.1 Let (X, d) be a metric space and let C e H(X). Define a trans- 
formation w 0 : H(X) -> H(X) by w 0 (B) = C for all B e H(X). Then w 0 is called a 
condensation transformation and C is called the associated condensation set. 

Observe that a condensation transformation w 0 : H(X) -> H(X) is a contraction 
mapping on the metric space (H(X), h(d)), with contractivity factor equal to zero, 
and that it possesses a unique fixed point, namely the condensation set. 

Definition 9.2 Let {X; w\, W2, . ■ ■ , w n } be a hyperbolic IFS with contractivity 
factor 0 < s < 1. Let w 0 : H(X) -> H(X) be a condensation transformation. Then 
{X; wo, w i, . . . , w n ) is called a hyperbolic IFS with condensation, with contractivity 
factor s . 

Theorem 7.1 can be modified to cover the case of an IFS with condensation. 

Theorem 9. 1 Let {X; w n : n = 0, 1 , 2, . . . , N } be a hyperbolic iterated function 
system with condensation, with contractivity factor s. Then the transformation W : 
H(X) -> H(X) defined by 

W(B) = U n n=0 w n (B)VBeH(X) 
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Figure 111.67. A geo- 
metric series of pine trees, 
the attractor of an IFS with 
condensation. 




is a contraction mapping on the complete metric space (7i(X), h(d)) with contrac- 
tivity factor s. That is 

h(W(B), W(C)) < s ■ h(B, CyiB, C e H(X). 

Its unique fixed point, A e H(X), obeys 

A = W(A) = U n n=0 w n (A) 

and is given by A = lim^oo W on (B)for any B e H(X). 

Examples & Exercises 

9.1. A sequence of sets [A n C X}~ 0 , where (X, d) is a metric space, is said to be 
increasing if A 0 C A x C A 2 C • • • and decreasing if A 0 D Ai d A 2 D • • The in- 
clusions are not necessarily strict. A decreasing sequence of sets [A n c H(X)}™ =0 
is a Cauchy sequence (prove it!). If X is compact then an increasing sequence of 
sets [A n c ft(X)}~ o is a Cauchy sequence (prove it!). Let {X; w 0 , w u . . . , w n ) be 
a hyperbolic IFS with condensation set C, and let X be compact. Let W 0 (B) = 
U n n=0 w n (B)VB e H(X) and let W(B) = U n n=l w n (B). Define [C n = W™(C)}™ =0 . 
Then Theorem 9.1 tells us {C n } is a Cauchy sequence in H(X) that converges to 
the attractor of the IFS. Independently of the theorem observe that 

C n = CU W(C) U W o2 (C) U . . . U W on (C) 

provides an increasing sequence of compact sets. It follows immediately that the 
limit set A obeys W 0 (A) = A. 

9.2. This example takes place in (IR 2 , Euclidean). Let C = = A 0 c 0& 2 

denote a set that looks like a scorched pine tree standing at the origin, with its trunk 
perpendicular to the jc-axis. Let 
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Figure 111.68. Sketch 
of a fractal tree, the at- 
tracted of an IPS with 
condensation. 



(x\ (0.15 0 \ (x\ (0.25\ 

Show that [M. 2 ; w 0 , w\} is an IFS with condensation and find its contractivity factor. 
Let A n = W on (A 0 ) for n = 1, 2, 3, . . ., where W(B) = U" n=Q w n (B) for B e H(R 2 ). 
Show that A n consists of the first (n + 1) pine trees reading from left to right in 
Figure III. 67. If the first tree required 0.1% of the ink in the artist's pen to draw, and 
if the artist had been very meticulous in drawing the whole attractor correctly, find 
the total amount of ink used to draw the whole attractor. 

9.3. What happens to the trees in Figure 111.67 if w\ ^* ^ is replaced by 

-GMV »y(:MV) 

in exercise 9.2? 

9.4. Find the attractor for the IFS with condensation {0& 2 ; w Q , w{\, where the 
condensation set is the interval [0, 1] and w\(x) = \x + 2. What happens if w\(x) = 

9.5. Find an IFS with condensation that generates the treelike set in Figure 111.68. 
Give conditions on r and 6 such that the tree is simply connected. Show that the tree 
is either simply connected or infinitely connected. 

9.6. Find an IFS with condensation that generates Figure 111.69. 

9.7. You are given a condensation map wq(x) in 0& 2 that provides the largest tree 
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Figure 111.69. An end- 
less spiral of little men. 




in Figure 111.46. Find a hyperbolic IFS with condensation, of the form {R 2 ; w 0 , w u 
w 2 }, which produces the whole orchard. What is the contractivity factor for this IFS? 
Find the attractor of the IFS {K 2 ; w u w 2 }. 

9.8. Explain why removing the command that clears the screen ("els") from Pro- 
gram 1 will result in the computation of an image associated with an IFS with con- 
densation. Identify the condensation set. Run your version of Program 1 with the 
"els" command removed. 



TO How to Make Fractal Models with the Help of the Collage Theorem 

The following theorem is central to the design of IFS's whose attractors are close to 
given sets. 

Theorem 10.1 (The Collage Theorem, (Barnsley 1985b)). Let (X, d) be 

a complete metric space. Let L e H(X) be given, and let€>0 be given. Choose an 
IFS (or IFS with condensation) {X; (w 0 ), w x , w 2 , . . . , w n ) with contractivity factor 
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0 < s < 1 , so that 

h(L,U n n=l w n (L))<€, 

(n=0) 

where h(d) is the Hausdorff metric. Then 

h(L, A) <€/(!- s), 
where A is the attr actor of the IFS. Equivalently, 

h(L, A) < (1 - s)~ l h(L, U" n=1 w n {L)) for all L € H(X). 

(«=0) 

The proof of the Collage Theorem is given in the next section. The theorem 
tells us that to find an IFS whose attractor is "close to" or "looks like" a given 
set, one must endeavor to find a set of transformations — contraction mappings on a 
suitable space within which the given set lies — such that the union, or collage, of the 
images of the given set under the transformations is near to the given set. Nearness 
is measured using the Hausdorff metric. 

Examples & Exercises 

10.1. This example takes place in (R, Euclidean). Observe that [0, 1] — [0, |] U 
[|, 1]. Hence the attractor is [0, 1] for any pair of contraction mappings w\ : K -> K 
and w 2 : R -> R such that wi([0, 1]) = [0, \] and w 2 ([0, 1]) = [\, 1]. For example, 
w\(x) = and w 2 (x) = ^x + ^ does the trick. The unit interval is a collage of two 
smaller "copies" of itself. 

10.2. Suppose we are using a trial-and-error procedure to adjust the coefficients 
in two affine transformations w \(x) = ax + b, w 2 (x) — cx + d, where a, b, c,d € 
R, to look for an IFS {R; w\, w 2 } whose attractor is [0, 1]. We might come up 
with wi(x) = 0.5 Ix - 0.01 and w 2 (x) = 0.47 x + 0.53. How far from [0, 1] will the 
attractor for the IFS be? To find out compute 

h ([0, 1], Uf =1 u; ( ([0, 1])) = h([0, 1], [-0.0/, 0.5] U [0.53, 1]) = 0.015 

and observe that the contractivity factor of the IFS is s = 0.51. So by the Collage 
Theorem, if A is the attractor, 

M[0, 1], A) < 0.015/0.49 < 0.04. 

10.3. Figure 111.70 shows a target set L c R 2 , a leaf, represented by the polygonal- 
ized boundary of the leaf. Four affine transformations, contractive, have been applied 
to the boundary at lower left, producing the four smaller deformed leaf boundaries. 
The Hausdorff distance between the union of the four copies and the original is ap- 
proximately 1.0 units, where the width of the whole frame is taken to be 10 units. 
The contractivity of the associated IFS {IR 2 ; w\, w 2 , w^, w 4 } is approximately 0.6. 
Hence the Hausdorff distance /i(Euclidean) between the original target leaf L and 
the attractor A of the IFS will be less than 2.5 units. (This is not promising much!) 
The actual attractor, translated to the right, is shown at lower right. Not surprisingly, 
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Figure 111.71. The re- 
gion bounded by a right- 
angle triangle is the union 
of the results of two simil- 
itudes applied to it. 

b 




a 



it does not look much like the original leaf! An improved collage is shown at the 
upper left. The distance h(L, U 4 n=l w n (L)) is now less than 0.02 units, while the con- 
tractivity of the IFS is still approximately 0.6. Hence h(L, A) should now be less 
than 0.05 units, and we expect that the attractor should look quite like L at the reso- 
lution of the figure. A, translated to the right, is shown at the upper right. 

10.4. To find an IFS whose attractor is a region bounded by a right-angle triangle, 
observe the collage in Figure 111.71. 
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Figure 111.72. Use the 

Collage Theorem to help 
you find an IFS consisting 
of two affine maps in IR 2 
whose attractor is close to 
this set. 



10.5. A nice proof of Pythagoras' Theorem is obtained from the collage in Fig- 
ure 111.71. Clearly both transformations involved are similitudes. The contractivity 
factors of these similitudes involved are (b/c) and (a/c). Hence the area A obeys 
A = (b/c) 2 A + (a/c) 2 A. This implies c 2 = a 2 + b 2 since A > 0. 

10.6. Figures 111.72— 111.76 provide exercises in the application of the Collage The- 
orem. Condensation sets are not allowed in working these examples! 

10.7. It is straightforward to see how the Collage Theorem gives us sets of maps for 
IFS's that generate A. A Menger Sponge looks like this: ^ ^ J ^ j^ ^ . Find an IFS 
for which it is the attractor. 

10.8. The IFS that generates the Black Spleenwort fern, shown in Figure 111.77, 
consists of four affine maps in the form 

(x \ ( r cos# — s sin# \ ( x \ ( h \ „ „ „ 
= -a a + / 10 = 1,2,3,4); 

y ) \ r sin # s cosQ J \y J \k ) 

see Table III.5. 

10.9. Find a collage of affine transformations in R 2 , corresponding to Figure 111.78. 

1 0. 1 0. A collage of a leaf is shown in Figure 111.79 (a). This collage implies the IFS 
{C; w\, W2, u>3, W4} where, in complex notation, 

u)i(z) = siz + (1 - Si)ai for i = 1, 2, 3, 4. 
■4 

Verify that in this formula a, is the fixed point of the transformation. The values 
found for s, and a, are listed in Table III.6. Check that these make sense in relation 
to the collage. The attractor for the IFS is shown in Figure 111.79 (b). 
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Figure 111.73. This image represents the attractor of 14 affine transformations in 
the Collage Theorem to help you find them. 



Use 



Figure 111.74. Use the 

Collage Theorem to help 
find a hyperbolic IFS of 
the form {0£ 2 ; w\, w 2 , u> 3 }, 
where w u w 2 , and io 3 are 
similitudes in IR 2 , whose 
attractor is represented 
here. You choose the 
coordinate system. 
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10. 11 . The attractor in Figure 111.80 is determined by two affine maps. Locate the 
fixed points of two such affine transformations on R 2 . 

10.12. Figure 111.81 shows the attractor for an IFS {R 2 ; w h i = 1, 2, 3, 4} where 
each w; is a three-dimensional affine transformation. See also Color Plate 3. The 



attractor is contained in the region {(x u x 2 , x 3 ) <E 
10, -10 < x 3 < 10}. 
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10.13. Find an IFS of similitudes in 1R 2 such that the attractor is represented by the 
shaded region in Figure 111.82. The collage should be "just-touching," by which we 
mean that the transforms of the region provide a tiling of the region: they should fit 
together like the pieces of a jigsaw puzzle. 

10. 14. This exercise suggests how to change the coordinates of an IFS. Let {X\,d\} 
and {X 2 , d 2 \ be metric spaces. Let {X^ W],w 2 , w N ] be a hyperbolic IFS with 
attractor A\. Let 0 : X\ X 2 be an invertible continuous transformation. Consider 
the IFS {X 2 ; 0 ow\o e~\ 9 o w 2 o6~\ ...,9ow N o 0~ { ). Use 0 to define a metric 
on X 2 such that the new IFS is indeed a hyperbolic IFS. Prove that if A 2 e H(X 2 ) is 



Table III .5. The IFS code for the Black Spleenwort, expressed in scale and angle formats. 



Translations Rotations Scalings 
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1 


0.0 


0.0 
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0 


0.0 


0.16 
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0.0 


1.6 


-2.5 


-2.5 


0.85 


0.85 
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0.0 


1.6 


49 


49 


0.3 


0.34 
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0.0 


0.44 


120 


-50 


0.3 


0.37 
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Fiaure III 75 Find an IFS of the form {R 2 ; w x , w 2 , u> 3 , ^4}, where the w.-'s are affine 
transformations on 1R 2 , whose attractor when rendered contains this image. Check your 
conclusion using Program 2. 

the attractor of the new IFS, then A 2 = 0(A l ). Thus we can readily construct an IFS 
whose attractor is a transform of the attractor of another IFS. 

10.15. Find some of the affine transformations used in the design of the fractal 
scene in Figure 111.83. 

10.16. Use the Collage Theorem to find an IFS whose attractor approximates the 
set in Figure 111.84. 
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Figure 111.76. How 

many affine transforma- 
tions in [R 2 are needed 
to generate this attractor? 
You do not need to use a 
condensation set. 



10.17. Solve the problems proposed in the captions of (a) Figure III. 85, (b) Fig- 
ure 111.86, (c) Figure 111.87. 



1 1 Blowing in the Wind: The Continuous Dependence 
of Fractals on Parameters 

The Collage Theorem provides a way of approaching the inverse problem: given a 
set L, find an IFS for which L is the attractor. The underlying mathematical principle 
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Figure 111.77. The 

Black Spleenwort fern. 
The top image illustrates 
one of the four affine 
transformations in the IFS 
whose attractor was used 
to render the fern. The 
transformation takes the 
triangle ABC to triangle 
abc. The Collage Theorem 
provides the other three 
transformations. The IFS 
coded for this image is 
given in Table III. 3. Ob- 
serve that the stem is the 
image of the whole set un- 
der one of the transforma- 
tions. Determine to which 
map number in Table III.3 
the stem corresponds. The 
bottom image shows the 
Black Spleenwort fern and 
a close-up. 





is very easy: the proof of the Collage Theorem is just the proof of the following 
lemma. 

Lemma 11.1 Let (X, d) be a complete metric space. Let f : X — > X be a con- 
traction mapping with contractivity factor 0 < s < 1 , and let the fixed point of f be 
Xf e X. Then 

d(x, x f ) < (1 - s)' 1 ■ d(x, f(x)) for all x e X. 
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Proof The distance function d(a,b), for fixed a e X, is continuous in b e X. 
Hence 

d(x,x f )=d(x, lim f on (x))= lim d(x, f on (x)) 

n 

< lim Y^/ 0 ^ 1 ^),/ 0 ^*)) 

n-+oo 

wz=l 

< lim d(x, f(x))(\ + s + ■ ■ ■ + < (1 - s)~ l d(x, f(x)). 

n-»oo 

This completes the proof. 

The following results are important and closely related to the above material. 
They establish the continuous dependence of the attractor of a hyperbolic IFS on 
parameters in the maps that constitute the IFS. 

Table 1 1 1. 6. Scaling factors and fixed points for the collage in Figure 111.79. 

s a 

0.6 0.45 + 0.9i 

0.6 0.45 + 0.3i 

0.4-0.3i 0.60 + 0.3i ■* 

0.4 + 0.3i 0.30 + 0.3i 



Figure 111.78. Use the 

Collage Theorem to find 
the four affine transfor- 
mations corresponding to 
this image. Can you find 
a transformation which 
will put in the "missing 
corner"? 



(a) Collage 




(b) Attractor 



Figure 111.79. A col- 
lage of a leaf is obtained 
using four similitudes, 
as illustrated in (a). The 
corresponding IFS is pre- 
sented in complex nota- 
tion in Table III.6. The 
attractor of the IFS is ren- 
dered in (b). 
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Lemma 1 1.2 Let (P, d p ) and (X, d) be metric spaces, the latter being com- 
plete. Let w : P x X X be a family of contraction mappings on X with contrac- 
tivity factor 0 < s < 1. That is, for each peP, w(p, •) is a contraction mapping on 
X. For each fixed xeXletwbe continuous on P. Then the fixed point ofw depends 
continuously on p. That is, x f :P-+Xis continuous. 

Proof Let x f (p) denote the fixed point of w for fixed p e P. Let p e P and 
e > 0 be given. Then for all q e P, 

d(x f (p),Xf(q)) = d(w(p,x f (p)), w(q, x f (q))) 
< d(w(p,x f (p)), w(q,x f (p))) 

+ d(w(q,Xf(p)),w(q,x f (q))) 
<d(w(p,x f (p)), w(q,x f (p))) +sd(x f (p),Xf(q)), 

which implies 

d(x f (p), x f (q)) < (1 - sy l d(w(p, x f (p)), w(q, x f (p))). 
The right-hand side here can be made arbitrarily small by restricting q to be suffi- 
ciently close to p. (Notice that if there is a real constant C such that 

d(w(p, x), w(q, x)) < Cd(p, q) for all p,qeP, for all x eX, 
then d{x f (p), x f (q)) < (1 - s)~ l • C ■ d(p, q), which is a useful estimate.) This 
completes the proof. 

Examples & Exercises 

11.1. The fixed point of the contraction mapping w : R R defined by w(x) = 
i x + p depends continuously on the real parameter p. Indeed, x f = 2p. 



Figure 111.80. Locate 
the fixed points of a pair 
of affine transformations 
in M. 2 whose attractor is 
rendered here. 
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J 



Figure 111.82. Find a 
"just-touching" collage of 
the area under this Devil's 
Staircase. 
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11.2. Show that the fixed function for the transformation w : C°[0, 1] C°[0, 1] 
defined by w( f(x)) = pf(2xmod\) +x(\-x)i* continuous in p for p e (-1, 1). 
Here, C°[0, 1] = {/ e C[0, 1] : /(0) = /(l) = 0} and the distance is d(f, g) = 
max{\f(x)-g(x)\:xe[0, 1]}. 

In order for this to be of use to us, we need some method of moving the con- 
tinuous dependence on the parameter p to H(X). We cannot do this just because the 
image of a point in some set B depends continuously on p, since, although this gives 
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Figure 111.84. "Typi- 
cal" fractals are not pretty: 
use the Collage Theorem 
to find an IFS whose at- 
tractor approximates this 
set. 



fa™ 





& 

l^n^ I^S"^ ^fy"?^ ^P"?^ 



Figure 111.85. Deter- 
mine the affine transfor- 
mations for an IFS corre- 
sponding to this fractal. 
Can you see, just by look- 
ing at the picture, if the 
linear part of any of the 
transformations has a neg- 
ative determinant? 
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Figure 111.86. Use the 

Collage Theorem to ana- 
lyze this fractal. On how 
many different scales is 
the whole image appar- 
ently repeated here? How 
many times is the smallest 
clearly discernible copy 
repeated? 





us a 8 to constrain p with in order that w(p, x) moves by less than e, this relation 
is still dependent on the point (p, x). A set B e H(X), which is interesting, contains 
an infinite number of such points, giving us no 8 greater than 0 to constrain p with to 
limit the change in the whole set. We can get such a condition by further restricting 
w(p, x). Many constraints will do this; we pick one that is simple to understand. For 
our IFS, parametrized by p e P, that is {X: w\ p , . . . , w Np }, we want the conditions 
under which given e > 0, we can find a 8 > 0 such that 



d p (p, q) <8=> h(w p (B), w q (B)) < e. 
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Figure 111.87. Consider 
the white areas in this fig- 
ure to represent a set S in 
[R 2 . Locate the boundary 
of the largest pathwise- 
connected subset of 5. It 
is recommended that you 
work with a photocopy 
of the image, a magnify- 
ing glass, and a fine red 
felt-tip pen. 
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Suppose that for every p e P, w ip {x) is a continuous function on X. Furthermore, 
we ask that there is a k > 0, independent of x and p such that for each fixed x e X 
and for each u;, , the condition 

d(w ip (x), w iq (x)) < k ■ d p (p, q) 

holds. This condition is called Lipshitz continuity. It is not the most general con- 
dition to prove what we need; we really only need some continuous function of 
d(p, q) which is independent of x on the right-hand side. We choose Lipshitz con- 
tinuity here because for the maps we are interested in, it is the easiest condition to 
check. If we can show that for any set B e TC(X) we have 

h(w ip (B),w iq (B))<k-d p (p,q), 

then we can easily get the condition we want from the Collage Theorem. Proving 
this is simply a matter of writing down the definitions for the metric h. 

h(w p (B), w q (B)) = d(w p (B), w q (B)) v d(w q (B), w p (B)), 

where 

d(w p (B),w q (B)) = max (d(x,w q (B))) 

xew p (B) 

d(x,w q (B))= min (d(x, y)). 

Now, x e w p (B) implies that there is an x e B such that x = w p (x). Then there is a 
point w q (x) e w q (B), which is the image of x under w q . For this point, our condition 
holds, and 

d(x,w q (x)) <k • d p (p,q) min (d(x, y)) < d(x, w q (x)) < k ■ d p (p, q) 

Since this condition holds, for every x e w p (B) the maximum over these points is at 
most k • d p (p, q), and we have 

d(w p (B),w q (B))<k-d p (p,q). 

The argument is nearly identical for d(w q (B), w p (B)), so we have 

h(w p (B),w q (B))<k-d p (p,q), 

and a small change in the parameter on a particular map produces a small change 
in the image of any set B e H(X). For a finite set of maps.u;^, . . . , w N , and 
their corresponding constants k\, . . . , k N , it is then certainly the case that if k = 
max i= \ N {ki), we have 

h(w ip (B),w iq (B))<k-d p (p,q). 

Now the union of such image sets cannot vary from parameter to parameter by more 
than the maximum Hausdorff distance above, consequently, 
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h(W p (B),W q (B))<kd p (p,q). 

We now apply the results of Lemma 11.2 to the complete metric space H(X), 
yielding 

h(A p , A q )<{\- s)" l h(A r , W q (A p )) < (1 - s)~ l k ■ d p (p, q). 

Theorem 11.1 Let (X, d) be a complete metric space. Let {X; w\ , . . . , w N ] be 
a hyperbolic IFS with contractivity s. For n = 1 , 2, . . . , N, let w n depend on the 
parameter p e (P,d p ) subject to the condition d(w„ p (x), w„ q (x)) < k ■ d p (p, q) for 
allx e X with k independent ofn,p,orx. Then the attr actor A(p) gH(X) depends 
continuously on the parameter p e P with respect to the Hausdorff metric h(d). 

In other words, small changes in the parameters will lead to small changes in the 
attractor, provided that the system remains hyperbolic. This is very important be- 
cause it tells us that we can continuously control the attractor of an IFS by adjusting 
parameters in the transformations, as is done in image compression applications. It 
also means we can smoothly interpolate between attractors: this is useful for image 
animation, for example. 

Examples & Exercises 

11.3. Construct a one-parameter family of IFS, of the form {R 2 ; w\, w 2 , W3}, 
where each w { is affine and the parameter p lies in the interval [0, 24]. The attrac- 
tor should tell the time, as illustrated in Figure III. 88. A(p) denotes the attractor at 
time p. 

11.4. Imagine a slightly more complicated clockface, generated by using a one- 
parameter family of IFS of the form {R 2 ; iv 0 , w\, w 2 , u; 3 }, p e [0, 24]. w 0 creates 
the clockface, w\ and w 2 are as in Exercise 1 1 .3, and is a similitude that places 
a copy of the clockface at the end of the hour hand, as illustrated in Figure III. 89. 
Then as p goes from 0 to 12 the hour hand sweeps through 360°, the hour hand on 
the smaller clockface sweeps through 720°, and the hour hand on the yet smaller 
clockface sweeps through 1080°, and so on. Thus as p advances, there exist lines 
on the attractor which are rotating at arbitrarily great speeds. Nonetheless we have 
continuous dependence of the image on p in the Hausdorff metric! At what times do 
all of the hour hands point in the same direction? 

1 1.5. Find a one-parameter family of IFS in R 2 , whose attractors include the three 
trees in Figure 111,90. 

1 1 .6. Run your version of Program 1 or Program 2, making small changes in the 
IFS code. Convince yourself that resulting rendered images "vary continuously" 
with respect to these changes. 

11.7. Solve the following problems with regard to the images (a)-(f) in Fig- 
ure 111.91. Recall that a "just-touching" collage in R 2 is one where the transforms 
of the target set do not overlap. They fit together like the pieces of a jigsaw puzzle. 
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Figure 111.88. A one- 
parameter family of IFS 
that tells the time! 




A(2 a.m.) 



172i° 




A(2.75 a.m.) 



9(T 



A(3 a.m.) 



Figure 111.89. This 
fractal clockface depends 
continuously on time in 
the Hausdorff metric. 
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Figure 111.90. Blowing 
in the wind. Find a one- 
parameter family of IFS 
whose attractors include 
the trees shown here. 
The Random Iteration 
Algorithm was used to 
compute these images. 



(a) Find a one-parameter family collage of affine transformations. 

(b) Find a "just-touching" collage of affine transformations. 

(c) Find a collage using similitudes only. What is the smallest number of affine 
transformations in U 2 , such that the boundary is the attractor? 

(d) Find a one-parameter family collage of affine transformations. 

(e) Find a "just-touching" collage, using similitudes only, parameterized by the 
real number p. 

(f) Find a collage for circles and disks. 
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Chapter IV 

Chaotic Dynamics 
on Fractals 



1 The Addresses of Points on Fractals 

We begin by considering informally the concept of the addresses of points on the 
attractor of a hyperbolic IFS. Figure IV.92 shows the attractor of the IFS: 

{£; wi(z) = (0.13 + 0.64/)z, w 2 (z) = (0.13 + 0.64/)z + 1}- 

This attractor, A, is the union of two disjoint sets, w\(A) and w 2 (A), lying to the 
left and right, respectively, of the dotted line ab. In turn, each of these two sets is 
made of two disjoint sets: 

tui(A) = wi(wi(A)) U w 2 (w l (A)), w 2 (A) = w 2 (w 2 (A)) U w 2 (w 2 (A)). 

This leads to the idea of addressing points in terms of the sequences of transfor- 
mations, applied to A, which lead to them. All points belonging to A, in the subset 
w\(wi (A)), are situated on the piece of the attractor that lies below dc and to the left 
of ab, and their addresses all begin 11.... Clearly, the more precisely we specify ge- 
ometrically where a point in A lies, the more bits to the address we can provide. For 
example, every point to the right of ab, below ef, to the left of gh, has an address 
that begins 212 .... In Theorem 2.1 we prove that, in examples such as this one, it is 
possible to assign a unique address to every point of A. In such cases we say that the 
IFS is "totally disconnected." 

Here is a different type of example. Consider the IFS 

1 11 11., 

{€; wi(z) = -z, w 2 (z) =2 Z+ 2' W = 2~ Z + 2 

The attractor, A, of tins IFS is a Sierpinski triangle with vertices at (0, 0), (1,0), 
and (0, 1). Again we can address points on A according to the sequences of trans- 
formations that lead to them. This time there are at least three points in A that 
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Figure IV.92. Ad- 
dresses of points on an 
attractor. The lines ab, cd, 
ef, and gh are not part of 
the attractor. 
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of town has an address b 
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have two addresses, because there is a point in each of the sets w\(A) n w 2 (A), 
w 2 (A) n w 3 (A), and W3(A) n tui(A) , as illustrated in Figure IV.93. 

On the other hand, some points on the Sierpinski triangle have only one address, 
such as the three vertices (0,0), (1,0), and (0, 1). Although the attractor is con- 
nected, the proportion of points with multiple addresses is "small," in a sense we 
do not yet make precise. In such cases as this we say that the IFS is "just-touching." 
Notice that this terminology refers to the IFS itself rather than to its attractor. 

Let us look at a third, fundamentally different example. Consider the hyperbolic 
IFS 

1 3 1 
{[0,1];-*,-* + -}. 

The attractor is A = [0, 1], but now 

Wl (A) n w 2 (A) = [0, i] n [i, l] = [i, X -\, 

so w\(A) n w 2 (A) is a significant piece of the attractor. The attractor would look 
very different if the overlapping piece [\, \] were missing. Now observe that every 
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31111] 
13353 




11111 



Figure IV.93. Some 
points on this Sierpin- 
ski triangle have two 
addresses, while others 
have only one address. 
Overlining on the last 
symbols, in an expres- 
sion such as 3 1 1 1 1 , means 
that the overlined symbols 
are repeated endlessly. 
For example, 31111 = 
31111111111111111 
1111 and 31123 = 
31123123123123... 



22222 



point in [\, \ ] has at least two addresses. On the other hand, the points 0 and 1 have 
only one address each. Nonetheless, it appears that the proportion of points with 
multiple addresses is large. In such cases we say that the IFS is "overlapping." 

The terminologies "totally disconnected," "just-touching," and "overlapping" re- 
fer to the IFS itself rather than to the attractor. The reason for this is that the same 
set may be the attractor of several different hyperbolic IFS's. Consider, for example, 
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Figure IV.94. Differ- 
ent IFS's with the same 
attractor provide differ- 
ent addressing schemes. 
Here the symbols {0, 1} 
are used in place of { 1 , 2} 
for obvious reasons. 



0 1 

I — (-h — i — l — I — l — I — I 

0000 0011.. oi n 01U 1111 
and and 

1000 1000 



Binary addressing of the 
interval [0,1] induced by the 
IFS {[0,l].0.5x,0.5x+0.5} 



0 1 

I — i_h — i — I — I — I — I— l 

0000 0010.. 0100 10100 1000 

and and 

1100 11000 



Alternative addresses of the 
interval [0,1] induced by the 
IFS {[0,l],0.5x,-0.5x+l} 



the two IFS's 

1 1 1, 

{[0, 1]; wi(x) = -x, w 2 (x) =2 X+ 2 } 

and 

{[0, 1]; u>i(jc) = ^x, w 2 (x) = --x + 1}. 

The attractor of each one is the real interval [0, 1]. We can obtain two different 
addressing schemes for the points in [0, 1], as illustrated in Figure IV.94. 
These two IFS are just-touching. However, the IFS 

1 3 1 

{[0, 1]; wdx) = -x, w 2 (x) = ^x + -} 

is overlapping, while its attractor is also [0, 1]. 

Examples & Exercises 

1 1 Figure IV.95 shows the attractor of an IFS of the form {R 2 ;w n ,n = \, 2, 3}, 
where each of the transformations w n : R 2 R 2 is affine. The addresses of several 
points are given. Find the addresses of a, b, and c. 
1 .2. In Figure IV.95 locate the point whose address is 1 1 1 1 1 . 
1 3 A quadtree is an addressing scheme used in computer science for addressing 
small squares in the unit square ■ = {(x lt x 2 ) e R 2 : 0 < x { < 1, 0 < x 2 < 1} as fol- 
lows. The square is broken into four quarters. Points in the first quarter have ad- 
dresses that begin 0, points in the second quarter have addresses that begin 1, and 
so on, as illustrated in Figure IV96. Find an IFS that gives rise to the addressing 
scheme suggested in Figure IV96. Is this a totally disconnected, just-touching, or 
overlapping IFS? 

1 .4. Addresses are assigned to the Sierpinski triangle, as in Figure IV93. Character- 
ize the addresses of the set_of points that lie on the outermost boundary, the triangle 
with vertices 11, 22, and 33. 
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Figure IV.95. Can you 

find the addresses of a, b, 
and c? 



1 .5. Characterize the addresses of points belonging to the boundary of the largest 
hole in Figure IV.97. 

1 .6. Consider a hyperbolic IFS with condensation set C. Suppose the condensation 
set is itself the attractor of another hyperbolic IFS. Design an addressing scheme for 
the attractor of the IFS with condensation. Can all possible addresses occur? 

1.7. Figure IV.98 shows an "overlapping" IFS attractor, for two affine transforma- 
tions in R 2 . Choose one point in each of the marked regions on the attractor. Find the 
first four numbers in two different addresses for each of these points. The first few 
numbers in the addresses of some points on the attractor are included in the figure to 
remove possible ambiguities. 

1.8. ★ Identify the set of addresses of points on the attractor, A, of a hyperbolic 
IFS with code space. Argue that nearby codes correspond to points on A which are 
nearby. 

1.9. Address the real number 0.7513 in each of the two coding schemes given in 
Figure IV.94. '* 

In thinking about the addresses of points on fractals, already we have been led to 
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Figure IV.96. Ad 

dresses at depth two in 
a quadtree. 
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try to compare "how many" points have a certain property to how many have an- 
other property. For example, in the case of the addressing scheme on the Sierpinski 
triangle described above, we wanted to compare the number of points with multiple 
addresses to the number of points with single addresses. It turns out that both num- 
bers are infinite. Yet still we want to compare their numbers. One way in which this 
may be done is through the concept of countability. 

Definition 1.1 Let S be a set. S is countable if it is empty or if there is an onto 
transformation c : I -> 5, where I is either one of the sets 

{1}, {1,2}, {1,2,3}, ...,{1, 2,3,... ,n},..., 

or the positive integers {1, 2, 3, 4, . . .}. 5 is uncountable if it is not countable. 

We think of an uncountable set as being larger than a countable set. 
We are going to make fundamental use of code space to formalize the concept of 
addresses. How many points does code space contain? 

Theorem 1 . 1 Code space on two or more symbols is uncountable. 

Proof We prove it here for the code space on the two symbols {1,2}. Denote 
an element of code space E by co = coico 2 co3 . . . , where each <u f e {1, 2}. Define 



Figure IV.97. Can you 

describe the addresses of 
the points on the bound- 
ary of the central white 
region? 



WJA) 
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W\(A) 




Figure IV.98. Attrac- 
tor of a hyperbolic IFS in 
the overlapping case. In 
the overlapping regions 
multiple addresses are 
available. 



Overlapping 
regions. 
Multiple 
addresses 
are available 



p : {1, 2} -> {1, 2} by p(l) = 2 and p(2) = 1. Suppose code space is countable. Let 
the counting function be c : {1, 2, 3, . . .} -> S. Consider the point a e E defined by 

cr = (j\(jjO'h • • • i 

where a M = p((c(n)) n ), and (c(n)) M means the nth symbol of c(n). When does the 
counting function reach a? Never! For example, c(3) / a because their third sym- 
bols are different! This completes the proof. 



Examples & Exercises 

1.10. The set of integers N = {0, ± 1 , ±2, . . . , } is countable. Define c : N -> N by 
c( z ) = (z - l)/2 if z is odd, c(z) = -z/2 if z is even. 
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1.11. Prove that a countable set of countable sets is countable. Show that an un- 
countable set, take away a countable set, is uncountable. 

1.12. The rational numbers are countable. A rational number can be written in the 
form p/q, where p and q are integers with q / 0. Figure IV.99 shows how to count 
the positive ones, some numbers being counted more than once. Make a rule that 
gets rid of the redundant countings. Also, show how to include the negative rationals 
in the scheme. 

1.13. Show that a Sierpinski triangle contains countably many triangles. 

1.14. Let S be a perfect subset of a metric space. Suppose that S contains more than 
one point. Prove that S is uncountable. 

1.15. Characterize the addresses of the missing pieces in Figure IV. 100. 



2 Continuous Transformations from Code Space to Fractals 

Definition 2.1 Let {X; w\,w 2 , ... , w N } be a hyperbolic IFS. The code space as- 
sociated with the IFS, (T,,d c ), is defined to be the code space on N symbols 
{1, 2, . . . , N}, with the metric d c given by 

dr(co,a)=y for all a>, a e D. 

CK ' } (N+l) n 



n=l 



Our goal is to construct a continuous transformation 0 from the code space as- 
sociated with an IFS onto the attractor of the IFS. This will allow us to formalize 
our notion of addresses. In order to make this construction we will need two lem- 
mas. The first lemma tells us that if we have a hyperbolic IFS acting on a complete 
metric space, but we are only interested in studying how the IFS acts in relation to a 
fixed compact subset of X, then we can treat the IFS as though it were defined on a 
compact metric space. 

Lemma 2. 1 Let {X; w„: n = 1 , 2, . . . , N} be a hyperbolic IFS, where (X, d) is a 
complete metric space. Let K e H(X). Then there exists K e H(X) such that K c K 
and w„ : K -> K for n = 1, 2, . . . , N . In other words, {K; w n : n = 1, 2, 3, . . . , N} 
is a hyperbolic IFS where the underlying space is compact. 

Proof Define W : H(X) -> H(X) by 

W(B) = U^ =l w n (B) for all 5 eH(X). 

To construct K consider the IFS with condensation {X; w n ; n — 0, 1, 2, . . . , N}, 
where the condensation map w 0 is associated with the condensation set K. By The- 
orem 7.1 in Chapter III the attractor of this IFS belongs to Ti(X). By exercise 9.1 in 
Chapter HI it can be written 

K = Closure of (K U W ol (K) U W o2 (K) U W o3 (K) U . . . U W on (K) U ). 
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It is readily seen that K C K arid that W(K) C K. This completes the proof. 

The next lemma makes the first step in linking code space to IFS attractors, by 
introducing a certain transformation <f>, which maps the Cartesian product space 
E x N x X into X. By taking appropriate limits, in Theorem 2.1 below, we will 
eliminate the dependence on N and X to provide the desired connection between E 
andX. 

Lemma 2.2 Let {X; w n : n = 1, 2, . . . , N) be a hyperbolic IFS of contractivity 
s, where (X,d) is a complete metric space. Let (£,d c ) denote the code space 
associated with the IFS. For each a e £, n e N , and x e X, define 

<p(a, n, x) = w ai o w ai o . . . o w 0n (x). 

Let K denote a compact nonempty subset ofX. Then there is a real constant D such 
that 

d(((>(a, m, xi), 0(cr, n, x 2 )) < Ds mAn 
for all a 6 E, allm, n 6 N, and all x u x 2 e K. 

Proof Let a, m, n, x u and x 2 be as stated in the lemma. Construct K from K 
as in Lemma 2.1. Without any loss of generality we can suppose that m <n. Then 
observe that 



where 



0(cr, n, x 2 ) = 0O, m, <f>(a), n - m, x 2 )), 



0) = O n - m + \(Tn-m+2 . . • <T„ . . . G E . 



Let x 3 = 4>(co, n - m, x 2 ). Then x 3 belongs to K. Hence we can write 
d(4>(a, m, 0(<r, n, x 2 )) = d(<t>(a, m, x x ), m, x 3 )) 

< sd(W ai O . . . O W am {X\), W ai O . . . O ^(^3)) 

< s 2 d(w a3 o . . . o Wa m (xi), w aj o . . . o w 0m (x 3 )) 
<s m d(x u x 3 ) <s m D, 

where D = max{^(xi, x 3 ) :x\,x 3 e K). D is finite because £ is compact. This com- 
pletes the proof. 

Theorem 2.1 Let (X, d) be a complete metric space. Let {X; w„ : n = 1, 2, . . . , 

N} fee a hyperbolic IFS. Let A denote the attractor of the IFS. Let (S, d c ) denote 
the code space associated with the IFS. For each a 6 £, n 6 N, and x e X, let 



4>(a, n, x) = w ai o w ai o . . . o w an (x). 



c(l) 
c(2) 
c(3) 
c(4) 
c(5) 
c(6) 



lit 
1/2 
2/1 
3/1 
2/2 

1/3 



c(7) . 1/4 
c(8) = 2/3 
c(9) - 3/2 
c(10) « 4/1 
c(ll) ~ S/l 
c(12) <* 4/2 
c(13) - 3/3 
0(14)^2/4 
c(15) = l/5 



Figure IV.99. How to 

count the positive rational 
numbers. What is c(24)? 



Then 



4>(o) = lim 4>((r,n, x) 
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Figure IV. 100. Char- 
acterize the addresses of 
the missing pieces. 
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jku .a >o>, 



exists, belongs to A, and is independent of x e X. If K is a compact subset ofX, 
then the convergence is uniform over x e K. The function 4> : T, — >■ A thus provided 
is continuous and onto. 

Proof Let x e X. Let K e H(X) be such that x e K. Construct K as in Lemma 
2.1. Define W : H(X) -* W(X) in the usual way. By Theorem 7.1 in Chapter III, W 
is a contraction mapping on the metric space (H(X), h(d)); and we have 

A = lim {W on (K)}. 

n-±oo 

In particular {W on (K)} is a Cauchy sequence in (H, h). Notice that (j>(a, n,x) e 
W on (K). It follows from Theorem 7.1 in Chapter II that if lim„^oo 0(a, n, x) exists, 
then it belongs to A. 

That the latter limit does exist follows from the fact that, for fixed a e E, 
{4>(a, n, x)}%L l is a Cauchy sequence: by Lemma 2.2 

d((f)(a, m, x), (f)(a,n, x)) < Ds mAn for all x e K, 

and the right-hand side here tends to zero as m and n tend to infinity. The uniformity 
of the convergence follows from the fact that the constant D is independent of 
x e K. 

Next we prove that 4> : T, — >■ A is continuous. Let e > 0 be given. Choose n so 
that s n D < e, and let a, co e £ obey 

m^2^ +1 ) m (W+l)" +1 

Then one can verify that a must agree with co through n terms; that is, o\ = co\, a 2 = 
tt>2, • . • , 0>i = &>n- It follows that for each m > n we can write 

d((f)(a, m, x), (f)(co, m, x)) = d((f>(a, n, xi), (f>(a, n, X2)), 
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for some pair xi,x 2 £ K.By Lemma 2.2 the right-hand side here is smaller than s n D 
which is smaller than e. Taking the limit as m -> oo we find 

d((t>(a),<P(cD))<:€. 

Finally, we prove that 4> is onto. Let aeA. Then, since 

A = lim W on ({x}), 

n-*oo 

it follows from Theorem 7.1 in Chapter II that there is a sequence [co {n) eX:« = 
1, 2, 3, . . .} such that 

lim n, x) = a. 

n— >-oo 

Since (£, d c ) is compact, it follows that {co (n) : n = 1, 2, 3, . . .} possesses a conver- 
gent subsequence with limit weX. Without loss of generality assume lim„_ >0O <w ( " ) 
= (o. Then the number of successive initial agreements between the components of 
(o {n) and co increases without limit. That is, if 

a(n)= number of elements in { j e iV : co^ = co k for 1 < k < j], 

where TV = {1, 2, 3, . . .}, then a(n) -» oo as n -> oo. It follows that 

d((p(co, n, x), 4>(to (n \ n, x)) < s a(n) D. 

By taking the limit on both sides as ** r oo we find d((p(o)), a) = 0, which implies 
tp((o) = a. Hence 0 : £ -> A is onto. This completes the proof. 

Definition 2.2 L<?r {X; u;„, n = 1, 2, 3, . . . , AM a hyperbolic IFS with associ- 
ated code space T, . Let <p : T, A be the continuous function from code space onto 
the am actor of the IFS constructed in Theorem 1. An address of a point a e A is any 
member of the set 

<p~\A) = {ft)£E :(f>(co) =a}. 

This set is called the set of addresses of at A. The IFS is said to be totally discon- 
nected if each point of its attractor possesses a unique address. The IFS is said to be 
just-touching if it is not totally disconnected yet its attractor contains an open set O 
such that 

(1) w t (0) n wj(O) = 0Vi, j g {1, 2, ... , N} with i ^ j; 

(2) U N \im i=l Wi(0) CO. 

An IFS whose attractor obeys (i) and (ii) is said to obey the open set condition. 
The IFS is said to be overlapping if it is neither just-touching nor disconnected. 

Theorem 2.2 Let {X; w n , n = 1, 2, . . . , TV} be a hyperbolic IFS with attractor 
A. The IFS is totally disconnected if and only if 

Wi(A)nwj(A) = 0Vi, j G {1,2, TV} with i ^ j. (1) 
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Proof If the IFS is totally disconnected, then each point on its attractor possesses 
a unique address. This implies Equation 3. If the IFS is not totally disconnected, then 
some point on its attractor possesses two different addresses. These must disagree 
at some first place: choose inverse images to get this place out front, to produce a 
contradiction to Equation 3. This completes the proof. 

Examples & Exercises 

2.1. Show that the attractor of the IFS {R; \x, \x + \} is just-touching. Classify 
the attractor for the IFS {R; \x, 1}. 

2.2. Prove that the attractor of the IFS {R; \x, \x + \} is overlapping. 

2.3. Consider the IFS {[0, 1], w n {x) = ^ + ±x, n = 1, 2, 3, . . . , 10} and for the 
associated code space use the symbols {0, 1, 2, . . . , 9}. Show that the attractor of 
the IFS is [0, 1] and that it is just-touching. Identify the addresses of points with 
multiple addresses. Show that the address of a point is just its decimal representation. 
Comment on the fact that some numbers have two decimal representations. 

2.4. Prove that the attractor to the IFS {[0, 1]; w\(x) = \x, w 2 {x) = \x + §} is 
totally disconnected. 

2.5. Prove that the IFS that generates the Black Spleenwort fern, given in Chapter 2, 
is just-touching. 

2.6. Show that the IFS {[0, 1]; w\(x) = \, w 2 {x) = \) is overlapping. 

We need to understand the structure of code space. Theorem 2. 1 told us that the 
code space on N symbols is the mother of all hyperbolic IFS consisting of N maps. 
We will use the following theorem to show that the mother is metrically equivalent 
to a classical Cantor set. 

Theorem 2.3 Let E denote the code space of the N symbols, {1,2,..., N}, and 

define two different metrics on E by 

Then CE,d\) and (E , di) are equivalent metric spaces. 

Proof We give the proof for the case N = 10. Let xjeE be given. Clearly 
we have d 2 {x,y) <d\(x,y). We must show that there is a constant C so that 
Cd\{x, y) < d 2 (x, y), where C is independent of x and y. Here we pick C = ^ 
and show that it works. 

We can suppose that for some k e {1, 2, 3, . . .}, x\ = y\, x 2 = y 2 , . . . , x^-i = 
y k -ux k ^ y k . Then 
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This completes the proof. 

We now show that code space is metrically equivalent to a totally disconnected 
Cantor subset of [0, 1]. Define a hyperbolic IFS by {[0, 1]; w n (x) = jj^x + ^ : 
n = 1,2, ...,N}. Thus 

n n + 1 



w n ([0, 1]) = [- 



■] 



forn = 1,2, N, 



■JV+1JV + 1 
as illustrated for TV = 3 in Figure IV. 101. 

The attractor for this IFS is totally disconnected, as illustrated in Figure IV. 102 
for N = 3. 

In the case TV = 3, the attractor is contained in [\ , 1]. The fixed points of the three 
transformations w { (x) = \x + \, w 2 (x) = \x + \, w 2 (x) = \x + \ are \, |, and 1, 
respectively. Moreover, the address of any point on the attractor is exactly the same 
as the string of digits that represents it in base TV + 1. What is happening here is 
this. At level zero we begin with all numbers in [0, 1] represented in base (TV + 1). 
We remove all those points whose first digit is 0. For example, in the case TV = 3 
this eliminates the interval [0, |]. At the second level we remove from the remaining 
points all those that have digit 0 in the second place. And so on. We end up with 
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Figure IV. 102. A spe- 
cial ternary Cantor set in 
the making. 




those numbers whose expansion in base (N + 1) does not contain the digit 0. Now 
consider the continuous transformation 0 : (Y,,dc) — ► (A, Euclidean). It follows 
from Theorem 2.3 that the two metric spaces are equivalent. 0 is the transformation 
that provides the equivalence. Thus, we have a realization, a way of picturing code 
space. 

Examples & Exercises 

2.7. Find the figure analogous to Figure IV. 102, corresponding to the case N = 9. 

2.8. What is the smallest number in [0, 1] whose decimal expansion contains no 
zeros? 

We continue to discuss the relationship between the attractor A of a hyperbolic 
IFS {X; w\, u>2, ■ ■ ■ , wn} and its associated code space E. Let 0 : T, — >• X be the 
code space map constructed in Theorem 2.1. Let co = CO1CO2CO3CO4 ... be an address of 
a point x e A. Then 

CO = j (0\(02(Ot,(04 . . . 

is an address of wj(x), for each j e {1, 2, ... , TV}. 

Definition 2.3 Let {X, w\, W2, ■ . . , w^} be a hyperbolic IFS with attractor A. 
A point a e A is called a periodic point of the IFS if there is a finite sequence of 
numbers {o(n) e {1,2, ... , N}}^ =1 such that 

a = w a(P) o w a(P -i) o . . . o w a(V) (a). (2) 

If a 6 A is periodic, then the smallest integer P such that the latter statement is true 
is called the period of a. 

Thus, a point on an attractor is periodic if we can apply a sequence of iu M 's to it, 
in such a way as to get back to exactly the same point after finitely many steps. Let 
a e A be a periodic point that obeys (2). Let a be the point in the associated code 
space, defined by 

o=o(P)o(P - \)...o(\)o(P)o(P - 1) ...o(\)o(P)o{P - 1)... 
= o(P)o{P -\)...o(\). (3) 
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Then, by considering lim„^oo n, a), we see that 0 (a ) = a. 

Definition 2.4 A point in code space whose symbols are periodic, as in (3), is 
called a periodic address. A point in code space whose symbols are periodic after a 
finite initial set is omitted is called eventually periodic. 

Examples & Exercises 

2.9. An example of a periodic address is 
1212121212121212121212121212121212121212121212121212121212 

where 12 is repeated endlessly. An example of an eventually periodic address is: 
1121111112111121111211112122121121212121212121212121212121..., 

where 21 is repeated endlessly. 

2.10. Prove the following theorem: "Let {X; w u w 2 , . ■ ■ , w N } be a hyperbolic IFS 
with attractor A. Then the following statements are equivalent: 

(1) x e A is a periodic point; 

(2) x e A possesses a periodic address; 

(3) x e A is a fixed point of an element of the semigroup of transformations 
generated by {w\, w 2 , ■ . • , wn)" 

2.1 1 . Show that a point x e [0, 1] is a periodic point of the IFS 

1 1 !, 
<[0.i]; 5 *,-* + 5 } 

if and only if it can be written x = p/(2 N - 1) for some integer 0 < p < 2 N - 1 and 
some integer JVe{l,2, 3,...}. 

2.12. Let {X; w u w 2 , . . ■ , w N } denote a hyperbolic IFS with attractor A. Define 
W(S) = U% =l w n (S) when 5 is a subset of X. Let P denote the set of periodic points 
of the IFS. Show that W(P) = P. 

2.13. Locate all the periodic points of period 3 for the IFS {R 2 ; \z, \z + \, \z + 
|}. Mark the positions of these points on A. 

2.14. Locate all periodic points of the IFS {R; w^x) = 0, w 2 (x) = \x + \}. 
Theorem 2.4 The attractor of an IFS is the closure of its periodic points. 
Proof Code space is the closure of the set of periodic codes. Lift this statement to 

A using the code space map <p : £ -> A. (0 is a continuous mapping from a metric 
space E onto a metric space A. If S C £ is such that its closure equals S, then the 
closure of f(S) equals A.) 

Examples & Exercises 

2.15. Prove that the attractor of a totally disconnected hyperbolic IFS of two or 
more maps is uncountable. 
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2. 1 6. Under what conditions does the attractor of a hyperbolic IFS contain uncount- 
ably many points with multiple addresses? Do not try to give a complete answer; just 
some conditions: think about the problem. 

2. 1 7. Under what conditions do there exist points in the attractor of a hyperbolic 
IFS with uncountably many addresses? As in 2.16, do not try to give a full answer. 

2.18. In the standard construction of the classical Cantor set C, described in exer- 
cise 1.5 in Chapter III, a succession of open subintervals of [0, 1] is removed. The 
endpoints of each of these intervals belong to C. Show that the set of such interval 
endpoints is countable. Show that C itself is uncountable. C is the attractor of the IFS 
{[0, 1]; jx, !•* + §}. Characterize the addresses of the set of interval endpoints in C. 



3 Introduction to Dynamical Systems 

We introduce the idea of a dynamical system and some of the associated terminol- 
ogy. 

Definition 3. 1 A dynamical system is a transformation f : X ^ X on a metric 
space (X, d). It is denoted by {X; /}. The orbit of a point x e X is the sequence 

{/ 0 "(*)C=o- 

As we will discover, dynamical systems are sources of deterministic fractals. The 
reasons for this are deeply intertwined with IFS theory, as we will see. Later we 
will introduce a special type of dynamical system, called a shift dynamical system, 
which can be associated with an IFS. By studying the orbits of these systems we 
will learn more about fractals. One of our goals is to learn why the Random Iteration 
Algorithm, used in Program 2 in Chapter III, successfully calculates the images of 
attractors of IFS. More information about the deep structure of attractors of IFS will 
be discovered. 

Examples & Exercises 

3. 1 . Define a function on code space, / : £ — ► S, by 

fix 1X2X3X4 . . .) = X2X3X4X5 .... 

Then {£; /} is a dynamical system. 

3.2. {[0, 1]; fix) = kxil — x)} is a dynamical system for each X e [0, 4]. We say 
that we have a one-parameter family of dynamical systems. 

3.3. Let wix) = Ax + t be an affine transformation in R 2 . Then {R 2 ; w] is a dy- 
namical system. 

3.4. Define T : C[0, 1] -* C[0, 1] by 

11 111 
(r/)(x) = -/(-*) + -/(-* + -). 
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Figure IV. 103. An 

example of a "stretch, 
squeeze, and bend" dy- 
namical system (Smale 
horseshoe function). 



Finish. Put the deformed 
space back on itself. 



Then {C[0, 1]; T) is a dynamical system. 

3.5. Let w: C -> C be a Mobius transformation. That is w(z) = (az + b)/ (cz + d), 
where a, b, c, deC, and (ad - be) ^ 0. Then {C; w(z)} is a dynamical system. 

3.6. {[0, 1]; 2xmod\) is a dynamical system. Here Ixmodl = 2x - [2x], where 
[2x] denotes the greatest integer less than or equal to 2x. 

3.7. Define a transformation /:■->■ as illustrated in Figure IV. 103. {■; /} is a 
dynamical system. 

In dynamical systems theory one is interested in what happens when one follows a 
typical orbit: is there some kind of attractor that usually occurs? Dynamical systems 
become interesting when the transformations involved are not contraction mappings, 
so that a single transformation suffices to produce interesting behavior. The orbit of 
a single point may be a geometrically complex set. Some thought about horizontal 
slices through Figure IV. 104 will quickly suggest to the inquisitive student that there 
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Figure IV. 104. One 

million iterations of a 
small black square in a 
"stretch, squeeze, and 
bend" dynamical system. 
Can you find a relation- 
ship to IFS theory? 



is a close relationship between this noncontractive dynamical system and a hyper- 
bolic IFS. 

Definition 3.2 Let {X; f)bea dynamical system. A periodic point of f is a point 
x 6 X such that f on (x) = x for some n 6 {1 , 2, 3, . . .}. If x is a periodic point of f, 
then an integer n such that f on (x)=x,n €{1,2,3, ...} is called a period of x. The 
least such integer is called the minimal period of the periodic point x. The orbit of 
a periodic point of f is called a cycle of f. The minimal period of a cycle is the 
number of distinct points it contains. A period of a cycle of f is a period of a point 
in the cycle. 

Definition 3.3 Let {X; /} be a dynamical system and let x/ eXbe a fixed point 
off. The point x / is called an attractive fixed point of f if there is a number e >0so 
that f maps the ball B(x f ,e) into itself and moreover fis a contraction mapping on 
B(x f , e). Here B(x f , e) = {y 6 X : d(x f , y) < e}. The point x f is called a repulsive 
fixed point of f if there are numbers e > 0 and C > 1 such that 

d(f(x f ), f(y)) > Cd(x f , y) for all y 6 B(x f , e). 

A periodic point of / of period n is attractive if it is an attractive fixed point of 
f on . A cycle of period n is an attractive cycle of / if the cycle contains an attractive 
periodic point of / of period n. A periodic point of / of period n is repulsive if it 
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REPULSIVE FIXED POINT HflU» IV. 105. The 

dynamics of a simple 
Mobius transformation. 
Points spiral away from 
one fixed point and spiral 
in toward the other. What 
happens if the fixed points 
coincide? 



ATTRACTIVE FIXED POINT 



IN THE 
PLANE 




is a repulsive fixed point of f° n . A cycle of period n is a repulsive cycle of / if the 
cycle contains a repulsive periodic point of / of period n. 

Definition 3.4 Let {X, /} be a dynamical system. A point x e X is called an 
eventually periodic point of f if f om (x) is periodic for some positve integer m. 

Remark: The definitions given here for attractive and repulsive points are con- 
sistent with the definitions we use for metric equivalence and will be used through- 
out the text. The definitions used in dynamical systems theory are usually more topo- 
logical in nature. These are given later in exercises 5.4 and 5.5. 

Examples & Exercises 

3.8. The point x/ = 0 is an attractive fixed point for the dynamical system {R; ^x}, 
and a repulsive fixed point for the dynamical system {R; 2x}. 

3.9. The point z = 0 is ah attractive fixed point, and z = oo is a repulsive fixed 
point, for the dynamical system 

{C; (cos 10° + / sin 10°)(0.9)z}. 



SPHERE 
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Figure IV. 106. Points 
belonging to an orbit of a 
Mobius transformation on 
a sphere. 




Figure IV. 107. This 
shows an example of 
a web diagram. A web 
diagram is a means for 
displaying and analyzing 
the orbit of a point 
for a dynamical system 
(R, /). The geometrical 
construction of a web 
diagram makes use of the 
graph of f(x). 




A typical orbit, starting from near the point of infinity on the sphere, is shown in 
Figures IV. 105 and IV. 106. 

3.10. The point x f = lllTTT is a repulsive fixed point for the dynamical system 
{E; /} where / : E ->• E is defined by 

/(* 1*2*3*4*5 • • •) = *2*3*4*5 
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Show that jc = 121212 is a repulsive fixed point of period 2, and that {1212, 2121} is 
a repulsive cycle of period 2. 

3.11. The dynamical system {[0, 1]; |jc(1 - jc)} possesses the attractive fixed point 
Xf = 0. Can you find a repulsive fixed point for this system? 

There is a delightful construction for representing orbits of a dynamical system 
of the special form {R; /(jc)}. It utilizes the graph of the function f : R -* R. We 
describe here how it is used to represent the orbit {x n = f on (x 0 )}™ =l of a point 
xq e R. 

For simplicity we suppose that / : [0, 1] -> [0, 1]. Draw the square {(jc, y) : 0 < 
x < 1, 0 < y < 1} and sketch the graphs of y = /(jc) and y = x for jc e [0, 1]. 
Start at the point (jc 0 , jc 0 ) and connect it by a straight-line segment to the point 
Oto, jci = /(Jc 0 )). Connect this point by a straight-line segment to the point (jcj, jcj). 
Connect this point by a straight-line segment to the point (jcj , jc 2 = /(jcj)); and con- 
tinue. The orbit itself shows up on the 45° line y = jc, as the sequence of points 
(jco.jco), C*i, C*2, x 2 ), .. We call the result of this geometrical construction a 
web diagram. 

It is straightforward to write computergraphical routines that plot web diagrams 
on the graphics display device of a microcomputer. The following program is written 
in BASIC. It runs without modification on an IBM PC with Color Graphics Adaptor 
and Turbobasic. On any line the words preceded by a ' are comments: they are not 
part of the program. 



Program 1 . 

1=3.79 : xn=0.95 

def fnf (xn)=l*xn*(l-xn) 

screen 1 : els 
window (0,0)-(l,l) 

for k=l to 400 
pset(k/400, fnf(k/400)) 
next k 
do 

n=n+l 
y=f nf (xn) 

line (xn,xn)-(xn,y) , n 
line (xn,y)-(y,y) , n 
xn=y .-1 



'parameter value 3.79, orbit starts 
at 0.95 

'change this function f (x) for other 

dynamical systems 
'initialize computer graphics 
'set plotting window to 0 < x < 1 , 

0 < y < 1 

'plot the graph of the f(x) 



'the main computational loop 
'increment the courter, $n$ 
'compute the next point on the orbit 
'draw a line from (xn,xn) to (xn,y) 
in color n 

'draw a line segment from (xn,y) to 

(y,y) in color n 
'set xn to be the most recently computed 

point on the orbit 



loop until instat : end 'stop running if a key is pressed. 
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Two examples of some web diagrams computed using this program are shown in 
Figure IV.108. The dynamical system used in this case is {[0, 1]; f(x) = 3.79jc(1 — 



Examples & Exercises 

3. 1 2. Rewrite Program 1 in a form suitable for your own computer environment. 
Use the resulting system to study the dynamical systems {[0, 1]; Ajc(1 — jc)} for A = 
0.55, 1.3, 2.225, 3.014, 3.794. Try to classify the various species of web diagrams 
that occur for this one-parameter family of dynamical systems. 

3.13. Divide [0, 1] into 16 subintervals [0, -L), [^, ^), . . . , ]§), [jf , Let 
/ : [0, 1] — ► [0, 1] be defined by f(x) = Xx(l—x), where A e [0, 4] is a parame- 
ter. Compute {f on {\) : n = 0, 1,2,..., 5000} and keep track of the frequency with 
which f on {\) falls in the fcth interval for k = 1, 2, 4, 8, 16, and A = 0.55, 1.3, 2.225, 

3.014. 3.794. Make histograms of your results. 

3.14. Describe the behavior for the one-parameter family of dynamical system 
s{E. U {oo}; Ajc}, where A is a real parameter, in the cases (i) A = 0; (ii) 0 < |A| < 1; 
(iii) A = -1; (iv) A = 1; (v) 1 < A < oo. 

3.15. Analyze possible behaviors of {IR 2 ; Ax + t}, where Ax + t is an affine trans- 
formation. 

3.16. Study possible behaviors of orbits for the dynamical system {<C;M6bius 
transformation}. You should make appropriate changes of coordinates to simplify 
the discussion. 

3.17. Show that all points are eventually periodic for the slide-and-fold dynamical 
system {IR; /}, where 



This system is illustrated in Figure IV. 109. 

3.18. Let {X; w\, w 2 , . . . , w N ] be a hyperbolic IFS. Then {H(X); W} is a dynami- 
cal system, where 



Dynamical systems that act on sets in place of points are sometimes called set 
dynamical systems. Show that the attractor of the IFS is an attractive fixed point of 
the dynamical system fH(X); W}. You should quote appropriate results from earlier 
theorems. 

3.19. We consider again our two-dimensional code space, having both past and 
future, called the space of shifts (see exercise 1.12 in Chapter II). In this space, the 
operation of the shift transformation is a homeomorphism of the space to itself (it is 
frequently called the shift automorphism). There is a very geometrical interpretation 



x)}. 



fix) = 



jc + l ifjc<0, 
-jc + 1 if x > 0. 



W(B) = U„ =l w n (B) for all B e H(X). 



3 Introduction to Dynamical Systems 137 




ORDER 



Figure IV. 108. Two 

examples of web di- 
agrams computed us- 
ing Program 1. The dy- 
namical system in this 
case is {[0, 1]; /(*) = 
Xx(\ - jc)}, for two dif- 
ferent values of k e (0, 4). 
The system corresponding 
to the lower value of X is 
orderly; the other is close 
to being chaotic. 
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Figure IV. 109. An or- 
bit of the "slide-and-fold" 
dynamical system de- 
scribed in example 3.17. 
Can you prove that all 
orbits are eventually peri- 
odic? 




of the shift automorphism here. We arrive at this by looking at the action of the shift 
transformation with the metric dk with k = N, as in exercise 2.19, Chapter II. To 
simplify the discussion, assume N = 2. The space of shifts is a two dimensional 
code space with points 

(x,y) = (xix 2 x 3 ...,yiy 2 y3...) 
on which we put the "Euclidean" metric (see exercise 2.6 in Chapter II), 



d((x u yO, (x 2 , y 2 )) = 

N 



xit ~x 2i \ y h - y 2i 



E^ + E 



J = l / \i = l / 

The shift transformation here is best described by writing 

(x,y) = ...y 3 y 2 yi.xix 2 X3.... 

We now shift by moving the dot one place to the right, to get 

T(x, y) = ... y 2 y\x x .x 2 x 3 x A ... = (x 2 x 3 x x yxy 2 . . .). 

With the metric just mentioned, we can relate it to the square [0, 1] x [0, 1]. Each 
point in this square has a binary expansion in terms of ones and zeros, so that a 
point (x, y) can be written (.x\x 2 . . . , .y\y 2 . . .), with precisely the same symbols 
and metric (Euclidean). The shift operation can now be seen as doing the following: 

stretch x: double x. This shifts the first digit up so that it is in the ones place. 
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Figure IV. 110. A sign 
of things to come. 



squeeze y: halve y. This shifts a zero into the first digit and shifts all other digits 
down. 

raise half the interval: If the digit now in the ones place for x is a 1, replace the 

new 0 first digit of y with a 1 . This adds a half to y . 
put it on top: If the digit in the ones place for jc is a 1, discard it. This brings the 

jc values to those between 0 and 1 , so this half of the points is put above the 

other half. 

What we have done is stretch the square to twice its width (double x) and half its 
height (halve y), cut the rectangle into two pieces at x = 1 , and put the right half on 
top of the bottom half (add 1/2 to y if the new jc is greater than 1). This operation 
of stretching out the square, cutting it, and stacking the pieces is called a baker's 
transformation, because it resembles a baker rolling, cutting, and stacking dough (to 
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make pastry, for example). It is identical to the shift transformation on_the space of 
shifts so long as the dough remains in distinct layers (unlike !R 2 , 01 # 10). 

Definition 3.5 This transformation is famous because it is the heart of any 
invertible "mixing" function (one has to allow any number of cuts and for uneven 
rolling) A mixing function is a function f such that given any set A (from some 
class of sets; here let's say sets with an interior), and any other set B from the same 
class, there is an N such that f n (A) n B # &for any n>N. 

The term mixing is appropriate: if A is red and B is blue, then eventually they 
are both somewhat purple (have both red and blue in them). A nice property of this 
mixing business is that there is at least one point in the space such that {/"(*) : n - 
12 } is dense, that is, given an open set O, there is an n such that /"(*) e O. 
When / has this property, we say that it has a dense orbit. T is mixing on the space 
of shifts and on code space, and it has a dense orbit as a result. 

Examples & Exercises 

3.20. Prove that for any code space E on N symbols, there is a point a e E, 
such "that a has a dense orbit under the shift transformation, that is {T n {o) : n = 
1, 2, 3, . . .} is dense in E. 

3.21 . Show that T is mixing on code space for the class of open sets. 



Dynamics on Fractals: Or How to Compute Orbits by Looking at Pictures 

We continue with the main theme for this chapter, namely dynamical systems on 
fractals. We will need the following result. 

Lemma 4.1 Let {X; w n , n = 1, 2, . . . , N] be a hyperbolic IFS with attractor A. 
If the IFS is totally disconnected, then for each n e {h 2, . . . , N),the transformation 
w n - A -> A is one-to-one. 

Proof We use a code space argument. Suppose that there is an integer n e 
{1 2, . N} and distinct points a u a 2 e A so that w„(fli) = w n (a 2 ) = a e A. If a x 
has address co and a 2 has address a, then a has the two addresses nco and no. This is 
impossible because A is totally disconnected. This completes the proof. 

Lemma 4. 1 shows that the following definition is good. 

Definition 4.1 Let {X; w n , n = 1, 2, . . . , N} be a totally disconnected hyper- 
bolic IFS with attractor A. The associated shift transformation on A is the trans- 
formation S : A A defined by 

S(a) = w~ l (a) for a ew n (A), 
where w n is viewed as a transformation on A. The dynamical system {A; S} is called 
the shift dynamical system associated with the IFS. 
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Examples & Exercises 

4. 1 . Figure IV. 1 1 1 shows the attractor of the IFS 

l R '°- 47 (:;)'°-(:;) + (0' o - 47 (:;) + (o)l- 

Figure IV. Ill also shows an eventually periodic orbit {a n = 5 , °"(ao)}^ = o f° r me 
associated shift dynamical system. This orbit actually ends up at the fixed point 
0(2222). The orbit reads a 0 = 0(13132222), a x = 0(31312222), a 2 = 0(132222), 
_3 = 0(32222), 04 = 0(2222), where 0 : £ — >■ A is the associated code space map. 
«4 e A is clearly a repulsive fixed point of the dynamical system. Notice how one 
can read off the orbit of the point do from its address. Start from another point very 
close to flo an d see what happens. Notice how the dynamics depend not only on A 
itself, but also on the IFS. A different IFS with the same attractor will in general lead 
to different shift dynamics. 

4.2. Both Figures IV.112 and IV.113 show attractors of IFS's. In each case the 
implied IFS is the obvious one. Give the addresses of the points {a n — S on (ao)}^ 0 
of the eventually periodic orbit in Figure IV.112. Show that the cycle to which the 
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Figure IV. 111. An 

orbit of a shift dynamical 
system on a fractal. 
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orbit converges is a repulsive cycle of period 3. The orbit in Figure IV.113 is either 
very long or infinitely long: why is it hard for us to know which? 

4.3. Figure IV. 114 shows an orbit of a point under the shift dynamical system 
associated with a certain IFS {R 2 \ w u w 2 , w 3 }, where w u w 2 , and w 2 are affine 
transformations. Deduce the orbits of the points marked b and c in the figure. 

4.4. Figure IV. 115 shows the start of an orbit of a point under the shift dy- 
namical system associated with a certain hyperbolic IFS. The IFS is of the form 
{R; wu w 2 , m}, where the transformations w n : R -+ R are affine and the attractor 
is [0, 1]. Sketch part of the orbit of the point labelled b in the figure. (Notice that this 
IFS is actually just-touching: nonetheless it is straightforward to define uniquely the 
associated shift dynamics on Of) A where O is the open set referred to in Definition 
2.2.) 

We can sharpen up the definition of the overlapping IFS with the aid of the mixing 

properties discussed in section 3. Let {X; Wi w N ) be a hyperbolic IFS, and 

define the set 

M = \J(Wi(A)Dwj(A)) 
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Figure IV. 11 3. A 

chaotic orbit getting 
started. The shift dy- 
namics are often wild. 
Why? 



of points in various intersections of the maps of the IFS. Then the following proper- 
ties hold: 

open interior: If there is a set O, open with respect to A, such that O C M, 
then the IFS is overlapping. This allows the IFS to be declared overlapping 
easily in some cases. The proof is not too difficult: Suppose this to be the 
case, namely that M contains an open set O. Suppose that 0\ were an open 
set that we thought might satisfy the open set condition for just-touching IFS. 
Then W n {0\) D O = 0 for all n, since 0\ can't contain points in the overlap, 
and maps inside itself. Using the continuous map 0 : £ A, we know that 
we would then have <t>~ x {0{) and </>~ l (0) both open sets in code space. But 
T n {(f>~ l {0)) must intersect <t>~ l {0{) in code space for some n, due to mixing, 
and an address in the intersection corresponds to a point a on the attractor 
such that W n ({a) thus intersects O. Hence O x cannot exist, and the IFS is 
overlapping. 

dense address: Notice that in order to prevent the IFS from being just-touching 
in the proof just given, the orbit of (f>~ l (M) only needs to be dense in E 
to end up with points in the image of any open set in A. Consequently, 
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Figure IV. 114. The 

orbit of the point a is 
shown. Can you plot 
the first few points of 
the orbits of b and c? 
Warning! The IFS here is 
not the usual one. See how 
the knowledge of some 
dynamics can imply some 
more! 




Figure IV. 115. This 
figure shows a sketch of 
part of an orbit of an 
IFS {[0, 1]; u>i, u> 2 , m) 
on its attractor [0,1]. 
The transformation w\ : 
[0, l]-> [0, 1] is affinefor 
i = 1, 2, 3. Sketch part of 
the orbit of b. 




an IFS is overlapping if the orbit of 0 -1 (M) in code space under the shift 
transformation is dense, 
empty interiors: If the IFS is made up of affine maps, and it "looks" just- 
touching, that is, M does not have an interior, then it is just-touching. The 
important property of affine maps used here is that they map boundary points 
to boundary points, and boundary points come from boundary points. 
4.5. The empty interiors property is not general, but is useful when it applies. To 
see why it is restricted, consider the following IFS, made of six translated copies of 
the following map: 

On the interval [-1 , 1], let 9{x) = Arccos(x), that is, map x to the point directly 
above it on the unit circle. Then take 0(x) to the point 6{x) - a sin6>(x), where 
a e [0, 1 /2). Then map the new point on the circle back to [-1, 1] by taking 0(x) 
to x' = cos 0(jc). Now fold the interval over at 0 with the map x' 2 , and ensure that it 
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is contractive by dividing by 3. The interval has now been mapped to [0, 1/3]. Call 
this map v(x). Explicitly, we have 

1 2 

v(x) = - cos ( Arccos (x ) — sin(Arccos(jc)). 
We now form 6 maps w\, . . . , by translations and inversions of v(x): 

W\ (X) — v(x) — \ U>2(x) 
1 

w 3 (x) = v(x) - - w 4 (x) 

1 

w 5 (x) = v(x) + - w 6 (x) 

The reader should be able to verify that the attractor of the IFS 

{[-1, 1]; Wi, W 2 , U>3, w 4 , w 5 , w 6 } 

is the interval [—1, 1], and that each of these maps touches any neighbor at a single 
point. There is a point xo(a) such that 

Arccos(jco) — or sin Arccos(xo) = 7r/2, 

whose image is the points 

{-1, -1/3, 1/3, 1}, 

and whose endpoints map to {—2/3,0, 2/3}. By choosing a at different values we 
can move x 0 around the interval [—1/3, 0]. If we pick a e £ to be a dense orbit in 
E under the shift transformation, we can successively approximate this address for 
x 0 such that the address of x 0 is 3a, and we can do this for each such a e S. We can 
also do this for a variety of periodic orbits, which are not dense. 

It turns out that for most values of jc 0 e [— 1/3, 0] (the probability of a value in this 
interval being one of these is 100%), this IFS is overlapping, although between every 
two values for which it is overlapping, there is a value for which it is just-touching. 
The attractors of this family of IFS are identical, as are the intersection points. 
This is thus both an example of an IFS that has a finite set of intersection points 
and is (sometimes) overlapping, and an example of one that does not go smoothly 
through the succession from totally disconnected to just-touching to overlapping. 
Small wonder these properties are defined for the IFS and not the attractor; they are 
really properties governing the behavior of addresses in code space. 



— —v(x) 

v ' 3 

= -v(x) + i 
= -v(x) + 1. 
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■4 

Definition 5. 1 Two metric spaces (Xi , d\) and (X 2 , d 2 ) are said to be topologically 
equivalent if there is a homeomorphism f: Xi — > X 2 . Two subsets Si C Xi and S2 C 
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Figure IV. 11 6. A com- 
mutative diagram that es- 
tablishes the equivalence 
between the two dynami- 
cal systems {Xi; f\] and 
{X 2 ; f 2 }- The function 
h : Xi — >• X 2 is a home- 
omorphism. 



X 2 are topologically equivalent, or homeomorphic, if the metric spaces (Si, d\) and 
(S 2 , d 2 ) are topologically equivalent. S\ and S 2 are metrically equivalent if (Si, d\) 
and (S 2 , d 2 ) are equivalent metric spaces. 

The Cantor set and code space, discussed following Theorem 2.3 in Chapter IV, 
are metrically equivalent. Theorem 8.5 in Chapter II tells us that if / : Xj — ► X 2 is 
a continuous one-to-one mapping from a compact metric (Xi, d\) onto a compact 
metric space (X 2 , d 2 ) then / is a homeomorphism. So by means of the code space 
mapping <p : £ — ► A (Theorem 2.1) one readily establishes that the attractor of a 
totally disconnected hyperbolic IFS is topologically equivalent to a classical Cantor 
set. 

Topological equivalence permits a great deal more "stretching" and "compres- 
sion" to take place than is permitted by metric equivalence. Later we will define a 
quantity called the fractal dimension. The fractal dimension of a subset of a metric 
space such as (R 2 , Euclidean) provides a measure of the geometrical complexity of 
the set; it measures the wildness of the set, and it may be used to predict your ex- 
citement and wonder when you look at a picture of the set. We will show that two 
metrically equivalent sets have the same fractal dimension. If they are merely topo- 
logically equivalent, their fractal dimensions may be different. 

With the naturally implied metrics, [0, 1] is homeomorphic to [0, 2]. ■ is homeo- 



What is more, 



is even homeomorphic to 



, and 



is 



morphic to 

homeomorphic to '/w' /A Xa . 

In fractal geometry we are especially interested in the geometry of sets, and in the 
way they look, when they are represented by pictures. Thus we use the restrictive 
condition of metric equivalence to start to define mathematically what we mean 
when we say that two sets are alike. However, in dynamical systems theory we are 
interested in motion itself, in the dynamics, in the way points move, in the existence 
of periodic orbits, in the asymptotic behavior of orbits, and so on. These structures 
are not damaged by homeomorphisms, as we will see, and hence we say that two 
dynamical systems are alike if they are related via a homeomorphism. 

Definition 5.2 Two dynamical systems {X\; f\] and {X 2 ; f 2 ) are said to be 
equivalent, or topologically conjugate, if there is a homeomorphism 6 : X) — ► X2 
such that 

f ] (x i )=6- 1 o f 2 o0(x { ) for alljd eXi, 

/2U2) = 0 o /, o 6» _1 (jc 2 ) for all x 2 e X 2 . 

In other words, the two dynamical systems are related by the commutative diagram 
shown in Figure IV.116. 

The following theorem expresses formally what should already be clear intu- 
itively from our experience with shift dynamics on fractals. 
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Theorem 5. 1 Let {X; vox , u>2, . . . , u>n} be a totally disconnected hyperbolic IFS 
and let {A; S} be the associated shift dynamical system. Let E be the associated code 
space ofN symbols and let T : E — >• E be defined by 

T(ai(T2&3 ■ ■ ■) — 020304 • • • for all a = (710203 • • • € E. 

Then the two dynamical systems {A; S} and {E; T} are equivalent. The homeomor- 
phism that provides this equivalence is <f) : E — >• A, as defined in Theorem 4.2.1. 
Moreover, {ax, a2, . . . , a p } is a repulsive cycle of period p for S if, and only if, 
(0(fli), 4>(a2), . . . , 4>(a p )} is a repulsive cycle of period pfor T. 

Examples & Exercises 

5.1. Let {Xi; fx} and {X2; fi) be equivalent dynamical systems. Let a homeomor- 
phism that provides this equivalence be denoted by 0 : Xi — >• X2. Show that 

{X\, X2, . ■ ■ , x p ) 

is a cycle of period p for {Xi ; f\ } if and only if 

{0(x l ),e(x 2 ),...,0(x p )} 

is a cycle of period p for {X2; /2}. Suppose that {x\ , x 2 , . . . , x p } is an attractive cycle 
for /]. Show that this does not imply that {0(jti), . . . , 0(x p )} is an attractive cycle for 

h- 

5.2. Let {Xi; f\] and {X2; /z} be equivalent dynamical systems. Let a homeomor- 
phism that provides this equivalence be denoted by 0 : Xi — >• X2. Let {/!°"(^)}^o ^ e 
an eventually periodic orbit of f\. Show that {f 2 on (0( x ))}^Lo is an eventually peri- 
odic orbit of f2- 

5.3. Let {Xi; f\) and {X2; fi) be equivalent dynamical systems. Let a homeomor- 
phism that provides this equivalence be denoted by 0 : Xi — >• X2. Let this homeo- 
morphism be such as to make the two spaces (Xi, d\) and (X2, di) metrically equiva- 
lent. Construct an example where Xf € Xi is a repulsive fixed point of the dynamical 
system {Xi, f\) ye\0{xf) is not a repulsive fixed point of {X2, ^2}- 

5.4. Let {Xi; f\) and {X2; f2) be equivalent metric spaces. Let a homeomorphism 
that provides their equivalence be denoted by 0 : X\ — >• X2. Let Xf € Xi be a fixed 
point of f\ . Suppose there is an open set O that contains Xf and is such that x € O 
implies lim^^oo f x on {x) — Xf. Show that there is an open neighborhood of 9{xf) in 
X2 with a similar property. 

5.5. Our definition of attractive and repulsive fixed points and cycles, Definition 
3.4, has the feature that it depends heavily on the metric. It is motivated by the 
situation of analytic dynamics where small disks are almost mapped into disks. 
Show how one can use exercise 5.4 to make a definition of an attractive cycle in 
such a way that attractiveness of cycles is preserved under topological conjugacy. 
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Figure IV. 11 7. Attrac- 
tive and repulsive fixed 
points in a web diagram 
for a differentiable dy- 
namical system. Analyze 
the ? points. 



ATTRACTIVE 
FIXED POINT 
|F(X)| < 1 




5.6. Let AcK. Then a function / : A -» A is differentiable at a point x 0 e A if 

fix) - f(xo) 



lim 

X^XQ 



X — Xq 



exists. If this limit exists it is denoted by f'(x 0 ). Let {R; w\, w 2 , . . . , w N ] be a to- 
tally disconnected hyperbolic IFS acting on the metric space (R, Euclidean). Sup- 
pose that, for each n = 1, 2, . . . , N, w n (x) is differentiable, with |u^(x)| > 0 for all 
x e R. Show that the associated shift dynamical system {A; 5} is such that 5 is dif- 
ferentiable at each point Jto e A and, moreover, |S'(xo)l > 1 for all * e A. 

5.7. Let {R; /} and {R; g} be equivalent dynamical systems. Let a homeomorphism 
that provides their equivalence be denoted by 6 : R -» R. If 0(x) is differentiable for 
all x e R, then the dynamical systems are said to be diffeomorphic. Prove that a\ is 
an attractive fixed point of / if and only if 6 (a i) is an attractive fixed point of g. 

5.8. Let {R; /} be a dynamical system such that / is differentiable for all x eR. 
Consider the web diagrams associated with this system. Show that the fixed points 
of / are exactly the intersections of the line y = x with the graph y = fix). Let a 
be a fixed point of /. Show that a is an attractive fixed point of / if and only if 
| fid) | < 1. Generalize this result to cycles. Note that if {a\, a 2 , . . . , a p ] is a cycle 
of period p, then £if op ix)\ x=ai = f'ia x ) f'(a 2 ) . . . f'ia p ). Assure yourself that the 
situation is correctly summarized in the web diagram shown in Figure IV. 1 17. 

5.9. Consider the dynamical system {[0, 1]; fix)} where 
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\\-2x when* e [0, \], 
/(jc) = | 2 jc — 1 when* e {\, 1]. 

Consider also the just-touching IFS {[0, 1], \x + \, -\x + \). Show that it is possi- 
ble to define a "shift transformation," 5, on the attractor, A, of this IFS in such a way 
that {[0, 1]; 5} and {[0, 1]; /(*)} are equivalent dynamical systems. To do this you 
should define 5 : A -+ A in the obvious manner for points with unique addresses; 
and you should make a suitable definition for the action of 5 on points with multiple 
addresses. 

5.10. Let {R 2 ; w u w 2 , w 3 ] denote a one-parameter family of IFS, where 



w 3 



GMT A)G)= 
-GMT ,WGMT)- 
GMT ,«GM?) '"" ,0 " 



Let the attractor of this IFS be denoted by A(p). Show that A(0) is a Cantor set 
and A(l) is a Sierpinski triangle. Consider the associated family of code space maps 
4>{p) : E A(p). Show that 0(p)(tr) is continuous in /> for fixed a e E; that is 
0(P)(<O : [0, 1] ^ is a continuous path. Draw some of these paths, including 
ones that meet at p = 1. Interpret these observations in terms of the Cantor set be- 
coming "joined to itself" at various points to make a Sierpinski triangle, as suggested 
in Figure IV. 118. . . 

Since the IFS is totally disconnected when p = 0, 0(p = 0) : £ A(0) is m- 
vertible. Hence we can define a continuous transformation 9 : A(0) A(l) by 
0 (jc) = 0 (p = \){(j)-\p = 0)(jc)). Show that if we define a set J(x) = {y e A(0) : 
= jc} for each jc e A(l), then J(x) is the set of points in A(0) whose associated 
paths meet at x e A(l) when p = 1. Invent shift dynamics on paths. 



6 The Shadow of Deterministic Dynamics 

Our goal in this section is to extend the definition of the shift dynamical system 
associated with a totally disconnected hyperbolic IFS to cover the just-touching and 
overlapping cases. This will lead us to the idea of a random shift dynamical system 
and to the discovery of a beautiful theorem. This theorem will be called the Shadow 

Theorem. * 

Let {X- wi w 2 , • ..,%} denote a hyperbolic IFS, and let A denote its attractor. 
Assume that w n : A A is invertible for each n = 1, 2, . . . , N, but that the IFS is 
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Figure IV. 11 8. Con- 
tinuous transformation of 
a Cantor set into a Sierpin- 
ski triangle. The inverse 
transformation would in- 
volve some ripping. 
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Figure IV. 11 9. The 

two possible shift dynam- 
ical systems associated 
with the just-touching IFS 
{[0, 1]; \x, \x + {} are 
represented by the two 
possible graphs of S(x). 
"Most" orbits are unaf- 
fected by the difference 



0 10 1 between the two systems. 



not totally disconnected. We want to define a dynamical system {A; S} analogous to 
the shift dynamical system defined earlier. Clearly, we should define 

S(x) = w~ l (x) when* e w n (A), but x & w m (A) form/n, 

for each n = 1,2, . . . , N. 

However, at least one of the intersections w m (A) n w n (A) is nonempty for some 
m^n. One idea is simply to make an assignment of which inverse map is to be 
applied in the overlapping region. For the case iV = 2we might define, for example, 

| w~ l (x) when x e 
X ~~\w2 l (x) when* e A \ w^A). 

In the just-touching case the assignment of where S takes points that lie in the 
overlapping regions does not play a very important role: only a relatively small 
proportion of points will have somewhat arbitrarily specified orbits. We look at some 
examples, just to get the flavor. 

Examples & Exercises 

6. 1 . Consider the shift dynamical systems associated with the IFS 

1 1 l , 
{[0,1];-*,-* + ^ 

We have S(x) = 2x for x e [0, ±) and S(x) = 2x - 1 for x e (±, 1]. We can define 
the value of S(\) to be either 1 or 0. The two possible graphs for S(x) are shown in 
Figure IV. 1 19. The only points xe[0,l] = A whose orbits are affected by the defi- 
nition are those rational numbers whose binary expansions end ... 01 1 1 or .. . 1000, 
the dyadic rationals. 

6.2. Show that if we follow the ideas introduced above, there is only one dynami- 
cal system {A; S} that can be associated with the just-touching IFS {[0, 1]; -\x + 
\, \x}. The key here ft that w~ l (x) = w^ x (x) for all x e wi(A) n w 2 (A). 

6.3. Consider some possible "shift" dynamical systems {A; S} that can be associ- 
ated with the IFS 
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In what ways are they 
alike? 



IFS {[0,1]; \x,\x + \}. 



Figure IV. 120. Two 

possible shift dynamical 
systems that can be associ 
ated with the overlapping 



1 




0 



1 



0 



1 




The attractor, A, is overlapping at the three points a = w\(A) D wi(A), b = 
wi(A) fl u>3(A), and c = w 3 (A) D wi(A). We might define S(a) = w^[ l (a) or 
W2 l {a), S(b) = u>2 l (b) or w^ l (b), and S(c) — w^(c) or w^(c). Show that re- 
gardless of which definition is made, the orbits of a, b, and c are eventually periodic. 

6.4. Consider a just-touching IFS of the form {IR 2 ; w\, W2, W3} whose attractor is 
an equilateral Sierpinski triangle A. Assume that each of the maps is a simili- 
tude of scaling factor 0.5. Consider the possibility that each map involves a rota- 
tion through 0°, 120°, or 240°. The attractor, A, is overlapping at the three points 
a = w\(A) fl 102(A), b = 102(A) fl 103(A), and c = wt,(A) fl w\(A). Show that it 
is possible to choose the maps so that w^ l (a) = u>2 l (a), u>2~ l (b) = w^ l (b), and 
w~ l (c) = w^(c). 

6.5. Is code space on two symbols topologically equivalent to code space on three 
symbols? Yes! Construct a homeomorphism that establishes this equivalence. 

6.6. Consider the hyperbolic IFS {E; t\, t2, ■ ■ ■ , t^}, where £ is code space on N 
symbols [\,2, . . . , N} and 



Show that the associated shift dynamical system is exactly {E; T) defined in Theo- 
rem 4.5.1. Can two such shift dynamical systems be equivalent for different values 
of N? To answer this question consider how many fixed points the dynamical system 
{E; T] possesses for different values of N . 

6.7. Consider the overlapping hyperbolic IFS {[0, 1]; \x, \x + \). Compare the 
two associated shift dynamical systems whose graphs are shown in Figure IV. 120. 
What features do they share in common? 

6.8. Demonstrate that code space on two symbols is not metrically equivalent to 
code space on three symbols. 

In considering exercises such as 6.7, where two different dynamical systems are 



t n a = no 



for all a e E . 
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OVERLAPPING 
NON-OVERLAPPING 



NON-OVERLAPPING 




Figure IV. 121. A par- 
tially random and par- 
tially deterministic shift 
dynamical system as- 
sociated with the IFS 
{[0, l];±x, fx + J}. 



UNIQUE 
DYNAMICS 



RANDOM 
DYNAMICS 



UNIQUE 
DYNAMICS 



associated with an IFS in the overlapping case, we are tempted to entertain the idea 
that no particular definition of the shift dynamics in the overlapping regions is to 
be preferred. This suggests that we define the dynamics in overlapping regions in 
a somewhat random manner. Whenever a point on an orbit lands in an overlapping 
region we should allow the possibility that the next point on the orbit is obtained by 
applying any one of the available inverse transformations. This idea is illustrated in 
Figure IV. 121, which should be compared with Figure IV. 120. 

Definition 6.1 Let {X; w u w 2 ) be a hyperbolic IFS. Let A denote the attractor 
of the IFS. Assume that both w x : A -> A and w 2 : A^ A are invertible. A sequence 
of points {x n }™ = 0 in A is called an orbit of the random shift dynamical system 
associated with the IFS if 

w~ l (x n ) whenx„ e w\(A) and x n &w\(A) f) w 2 (A), 

jc„+i = < W2 l (x n ) when x n e w 2 (A) and x n & w\(A) D w 2 {A), 

one of {w~\x n ), w^ixn)} when x n e wi(A) H w 2 (A), 

for each n £ {0, 1,2,.. .}. We will use the notation x n+l = S(x n ) although there may 
be no well-defined transformation S : A -> A that makes this true. Also we will write 
{A; S} to denote the collection of possible orbits defined here, and we will call {A; S} 
the random shift dynamical system associated with the IFS. 
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Notice that if wi(A) n w 2 (A) = 0 then the IFS is totally disconnected and the 
orbits defined here are simply those of the shift dynamical system {A; S} defined 
earlier. 

We now show that there is a completely deterministic dynamical system acting 
on a higher-dimensional space, whose projection into the original space X yields the 
"random dynamics" we have just described. Our random dynamics are seen as the 
shadow of deterministic dynamics. To achieve this we turn the IFS into a totally 
disconnected system by introducing an additional variable. To keep the notation 
succinct we restrict the following discussion to IFS's of two maps. 

Definition 6.2 The lifted IFS associated with a hyperbolic IFS {X; w u w 2 } is 

the hyperbolic IFS {X x E; w\, w 2 ], where E is the code space on two symbols 
{1,2} ,and 

w l (x,a) = (w l (x), la) for all (x, a) e X x E; 
w 2 ( x , a) = (w 2 (x), 2a) for all (x, a) e X x E. 

What is the nature of the attractor A c X x E of the lifted IFS? It should be clear 
that 

A = {x e A : (x, a) e A} and E = {a e E : (x, a) e A}. 

In other words, the projection of the attractor of the lifted IFS into the original space 
X is simply the attractor A of the original IFS. The projection of A into E is E. 
Recall that E is equivalent to a classical Cantor set. This tells us that the attractor of 
the lifted IFS is totally disconnected. 

Lemma 6.1 Let {X; w\, w 2 ] be a hyperbolic IFS with attractor A. Let the two 
transformations w x : A-*- A and w 2 : A A be invertible. Then the associated 
lifted IFS is hyperbolic and totally disconnected. 

Definition 6.3 Let{X;w u w 2 } be a hyperbolic IFS. Let the two transformations 
tyj : A A and w 2 : A Abe invertible. Let A denote the attractor of the associ- 
ated lifted IFS. Then the shift dynamical system {A; 5} associated with the lifted IFS 
is called the lifted shift dynamical system associated with the IFS. 

Notice that 

S(x, a) = (w-\x), T(a)) for all(X, a) e A, 

where 

T(a\a 2 aT,a4 . . .) = a 2 a3a 4 a 5 ... for all a = a\a 2 a3,a 4 e E. 

Theorem 6. 1 [(The Shadow Theorem).] Let {X; w u w 2 ] be a hyperbolic IFS of 
invertible transformations w x and w 2 and attractor A. Let {x n }%L Q be any orbit of the 
associated random shift dynamical system {A; S} . Then there is an orbit {x n }™ =Q of 
the lifted dynamical system {A; 5} such that the first component of x n isx n foralln. 
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LIGHT 



SEEN 
FROM 
THE 
SIDE 
THE 
SET 
IS 

TOTALLY 
DISCONNECTED 




X 



Cantor set of 
infinitesimal 

leaflets 

grouped 

in fours 



Each "leaflet" 
<~^> is a microcosm 
of the whole 
leaflet stack 



Figure IV. 122. The lift 
of the overlapping leaf 
attractor is totally dis- 
connected. Deterministic 
shift dynamics become 
possible. See also Fig- 
ure IV. 123. 



THE SHADOW OF THE CANTOR SET 
IS A LEAF, THE ATTRACTOR OF AN IFS. 



We leave the proofs of Lemma 6.1 and Theorem 6.1 as exercises. It is fun, how- 
ever, and instructive to look in a couple of different geometrical ways at what is 
going on here. 

Examples & Exercises 

6.9. Consider the IFS {C; w\(z), w 2 (z), W3(z), w 4 (z)} where, in complex notation, 
w x {z) = (0.5)(cos45° - V^Tsin45°)z + (0.4 - 0.2V^T), 
w 2 (z) = (0.5)(cos45° + V^Tsin45°)z - (0.4 + 0.2V-1), 

w 3 (z) = (0.5)z + 7^1(0.3), 

w 4 (z) = (0.5)z - 7^(0.3). 

A sketch of its attractor is included in Figure IV. 122. It looks like a maple leaf. The 
leaf is made of four overlapping leaflets, which we think of as separate entities, at 
different heights "above" the attractor. In turn, we think of each leaflet as consisting 
of four smaller leaflets, again at different heights. One quickly gets the idea: one 
ends up with a set of heights distributed on a Cantor set in such a way that the 
shadow of the whole collection of infinitesimal leaflets is the leaf attractor in the 
C plane. The Cantor set is essentially S. The lifted attractor is totally disconnected; 
it supports deterministic shift dynamics, as illustrated in Figure IV. 123. 
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Figure IV. 123. A pic- 
ture of the Shadow The- 
orem. Deterministic dy- 
namics on a totally discon- 
nected dust has a shadow 
that is dancing random 
shift dynamics on a leaf 
attractor. 
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DETERMINISTIC 
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6.10. Consider the overlapping hyperbolic IFS {R; \x, \x + \}. We can lift this to 
the hyperbolic IFS {R 2 ; wi(x), w 2 (x)}, where 

-(:)-(! DG)< 
-CMS PfeMI) 

The attractor A of this lifted system is shown in Figure IV. 124, which also 
shows an orbit of the associated shift dynamical system. The shadow of this orbit 
is an apparently random orbit of the original system. The Shadow Theorem asserts 
that any orbit {x n }™ =0 of a random shift dynamical system associated with the IFS 
{R; \x, \x + \ } is the projection, or shadow, of some orbit for the shift dynamical 
system associated with the lifted IFS. 

6. 1 1 . As a compelling illustration of the Shadow Theorem, consider the IFS 

1 3 1 
(R; 2*.4* + 4>- 

Let us look at the orbits {x n }%L 0 of the shift dynamical system specified in the 
left-hand graph of Figure IV.120. In this case we always choose S(x) = w^ix) 
in the overlapping region. What orbits {x n }™ = o of the lifted system, described in 
exercise 6.7, are these orbits the shadows of? Look again at Figure IV. 124! Define 
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Figure IV. 124. The 

Shadow Theorem asserts 
that the random shift 
dynamical system orbit on 
the overlapping attractor 
A is the shadow of a 
deterministic orbit on A. 



X— ► 



the top of A as 

A top = [(x, y) € A : (z, y) € A z < x , and y € [0, 1]}. 

Notice that 5 : A top -» A top . It is easy to see that there is a one-to-one correspon- 
dence between orbits of the lifted system {A top ; S} and orbits of the original system 
specified through the left-hand graph of Figure IV. 120. Indeed, 

{(x n , y n )}%L 0 is an orbit of the lifted system and (x 0 , y 0 ) € A top 
■$ 

M%Lo is an orbit of the left-hand graph of Figure IV. 120 

6.1 2. Draw some pictures to illustrate the Shadow Theorem in the case of the just- 
touching IFS {[0, 1]; \x, \x + \Y 

6.13. Illustrate the Shadow Theorem using the overlapping IFS {[0, 1]; -\x + 
+ Find an orbit of period 2 whose lift has minimal period 4. Do there exist 

periodic orbits whose lifts are not periodic? 

6.14. Prove Lemma 6.1. 



6. 1 5. Prove Theorem 6. 1 . 
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6.16. The IFS {E; u>\(a), . . . , wm(ct)} given by 

w n (a) = no 

for each n = 1, 2, . . . , N, has an interesting lift. Show that the lift cf this IFS, with 
a suitably defined inverse, is the shift automorphism on the space of shifts and 
therefore equivalent to the baker's transformation. 

6. 1 7. In section 5 it was shown that the associated shift dynamical system of any 
totally disconnected IFS is equivalent to the shift transformation on code space. 
Then we may replace the second map in the lift for the Shadow Theorem with such 
a totally disconnected IFS. That is, we could take a map like the leaf shown in 
Figures IV. 122 and IV. 123, and define the map 

{[R 2 x A; w\(x, y), . . . w 4 (x, y)}, 

where it), = (wj~ l (x, y), V{(x, y)), where v t are the maps of the totally disconnected 
IFS 
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Since this IFS produces an attractor that is totally disconnected, and therefore a copy 
of code space, the resulting lift is totally disconnected. What would a rendition of the 
lifted system look like if the maple leaf were lifted using a totally disconnected tree? 



7 The Meaningfulness of Inaccurately Computed Orbits Is Established by 
Means of a Shadowing Theorem 

Let {X; w\,W2, w N ) be a hyperbolic IFS of contractivity 0 < s < 1. Let A de- 
note the attractor of the IFS, and assume that w n : A — > A is invertible for each 
n = 1, 2, . . . , N . If the IFS is totally disconnected, let {A; S) denote the associated 
shift dynamical system; otherwise let {A; S} denote the associated random shift dy- 
namical system. Consider the following model for the inaccurate calculation of an 
orbit of a point x 0 e A. This model will surely describe the reader's experiences 
in computing shift dynamics directly on pictures of fractals. Moreover, it is a rea- 
sonable model for the occurrence of numerical errors when machine computation is 
used to compute an orbit. 
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Let an exact orbit of the point x 0 e A be denoted by {x n }™ =0 , where x n = S on (x 0 ) 
for each n. Let an approximate orbit of the point x 0 e A be denoted by {*n}£L 0 where 
jco = x 0 . Then we suppose that at each step there is made an error of at most 6 for 
some 0 < 6 < oo; that is, 

d(x n+ i,S(x n ))<e fom = 0, 1,2, .... 

We proceed to analyze this model. It is clear that the inaccurate orbit {x n }™ =0 will 
usually start out by diverging from the exact orbit {x n }™ =0 at an exponential rate. 
It may well occur "accidentally" that d(x n ,x n ) is small for various large values 
of n, due to the compactness of A. But typically, if d(x n ,x n ) is small enough, 
then d(x n+ j, x n+j ) will again grow exponentially with increasing j. To be precise, 
suppose d(x u S(x 0 )) = 6 and that we make no further errors. Suppose also that for 
some integer M, and some integers o\ , a 2 , . . . , &m e { 1 , 2, . . . , TV}, we have 

x n and x n e w an (A), for n = 0, 1, 2, . . . , M. 

Moreover, suppose that 

jc„+i = w~*{x n ) and x n+1 = w~*(x n ), for n = 0, 1, 2, . . . , M. 

Then we have 

d(x n+l , x n+ i) > s~ n 9, for n = 0, 1, 2, . . . , M. 

For some integer 7 > M it is likely to be the case that 

xj+i = w~j(x n ) and x n+ i = w^(x n ), for some oj # &j. 

Then, without further assumptions, we cannot say anything more about the correla- 
tion between the exact orbit and the approximate orbit. Of course, we always have 
the error bound 

d(x n ,x n ) < diam(A) = max{d(>, y):xeA,ye A}, for all n = 1, 2, 3, ... . 

Do the above comments make the situation hopeless? Are all of the calculations 
of shift dynamics we have done in this chapter without point because they are riddled 
with errors? No! The following wonderful theorem tells us that however many errors 
we make, there is an exact orbit that lies at every step within a small distance of our 
errorful one. This orbit shadows the errorful orbit. This type of theorem is extremely 
important in dynamics, and in any class of dynamical systems that has one (such as 
IFS) behavior that can be accurately analyzed using graphics on computers. Here we 
are use the word "shadows" in the sense of a secret agent who shadows a spy. The 
agent is always just out of sight, not too far away, usually not too close, but forever 
he follows the spy. 

Theorem 7.1 The Shadowing Theorem. Let {X; w u w 2 ,..., w N } be a hy- 
perbolic IFS of contractility s, where 0 < s < 1. Let A denote the attractor of the 
IFS and suppose that each of the transformations w n : A -> A is invertible. Let 
{A; S} denote the associated shift dynamical system in the case that the IFS is totally 
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disconnected; otherwise let {A; S} denote the associated random shift dynamical 
system. Let {x n }^L 0 C A be an approximate orbit of S, such that 

d(x n+1 , S{x n )) <0 for all n = 0, 1, 2, 3, ... , 

for some fixed constant 6 with 0 < 6 < diam(A). Then there is an exact orbit {x n = 
S on (xo)}^L 0 for some xq g A, such that 

sO 

d(x n+u x n+l ) < for all n = 0, 1, 2, ... . 

(1 -s) 

Proof As usual we exploit code space! For n = l,2, 3, . . ., let a n e {1, 
2, . . . , N) be chosen so that ly" 1 , iy~ ! , ly" 1 , . . ., is the actual sequence of inverse 
maps used to compute S.(xo), 5(Jci), S(x2), — Let 0 : E — >• A denote the code space 
map associated with the IFS. Then define 

x 0 = (p(aia 2 (T3 . . .). 

Then we compare the exact orbit of the point xq, 

{x n = S on (x 0 ) = <f>((r n+l a n+2 . . .)}~ o 

with the errorful orbit {x n }™ =0 . 

Let M be a large positive integer. Then, since x M and S(icM-i) both belong to A, 
we have 

J(5(jcm-i), S(xm-\) < diam(A) < cx). 

Since S(xm-i) and 5(jcm-i) are both computed with the same inverse map iy~j it 
follows that 

d(xM-\, xm-\) < ^ diam(A). 

Hence 

d(S(x M -2), S(x M -2)) = d{x M -\, S{x M -2)) 

< d(XM-l, xm-\) + d(XM-\, S(xm-2)) 

< 0 + sdiam(A); 

and repeating the argument used above we now find 

d(x M -2, xm-i)) <s(0 +s diam(A)). 
Repeating the same argument k times we arrive at 

d(x M -k, x M -k) <s6 +s 2 6 -\ h s k ~ l 9 + s k diam(A). 

Hence for any positive integer M and any integer n such that 0 < n < M, we have 

d(x n , x n ) <s0 +s 2 0 + ■■■+ s M - n ~ 1 0 + s M ~ n diam(A). 

Now take the limit of both sides of this equation as M — >• oo to obtain 

sO 

d(x n , x n ) < s0(l + s + s 2 H ) = — -, for all n = 1, 2, ... . 

(1 - s) 

This completes the proof. 
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Examples & Exercises 

7.1. Let us apply the Shadowing Theorem to an orbit on the Sierpinski triangle, 
using the random shift dynamical system associated with the IFS 

11 11 i 
(C; 5 z, 5 z + -,-z + 5 ). 

Since the system is just-touching we must assign values to the shift transformation 
applied to the just-touching points. We do this by defining 

S(x\ + ix 2 ) = 2*i mod 1 + i(2x 2 mod 1). 

We consider the orbit of the point * 0 = (0.2147, 0.0353). We compute the first 11 
points on the exact orbit of this point, and compare it to the results obtained when a 
deliberate error 6 = 0.0001 is introduced at each step. We obtain: 





Errorful 




Exact 


Xo = 


(0.2147,0.0353) 


S o0 (*o) = 


(0.2147,0.0353) 


X\ = 


(0.4295,0.0705) 


S ol (*o) = 


(0.4294, 0.0706) 


x 2 = 


(0.8591,0.1409) 


S o2 (x 0 ) = 


(0.8588,0.1412) 


*3 = 


(0.7183,0.2817) 


S°\x 0 ) = 


(0.7176,0.2824) 


JC4 = 


(0.4365, 0.5635) 


S oA (x 0 ) = 


(0.4352, 0.5648) 


*5 = 


(0.8731,0.1269) 


S o5 (*o) = 


(0.8704,0.1296) 


*6 = 


(0.7463,0.2537) 


S° 6 (xo) = 


(0.7408, 0.2592) 


X 7 = 


(0.4927, 0.5073) 


S ol (x 0 ) = 


(0.4816,0.5184) 


*8 = 


(0.9855,0.0145) 


S°\x 0 ) = 


(0.9632, 0.0368) 


Xg — 


(0.9711,0.0289) 


S o9 (*o) = 


(0.9264, 0.0736) 


*10 — 


: (0.9423,0.0577) 


5 ol0 (x 0 ) = 


: (0.8528,0.1472) 



Notice how the orbit with errors diverges from the exact orbit of Nonetheless, 
the shadowing theorem asserts that there is an exact orbit {x n } such that 

\ 

d(x n , x n ) < —^-j- (0.0001) = 0.0001, 
1 ~ 2 

where d(-, •) denotes the Manhattan metric. This really seems unlikely; but it must 
be true! Here's an example of such a shadowing orbit, also computed exactly. 





Exact Shadowing Orbit x n = S° n (x 0 ) 


d(x n ,x n ) < 0.0001 


Xo - 


= (0.21478740234375, 0.03521259765625) 


0.00009 


Xl 


= (0.4295748046875, 0.0704251953125) 


0.00008 


x 2 


= (0.8591496093750, 0.1408503906250) 


0.00005 


*3 


= (0.7182992187500, 0.2817007812500) 


0.000001 


JC4 


= (0.4365984375000, 0.5634015625000) 


0.0001 


*5 


= (0.8731968750000, 0.1268031250000) 


0.0001 


-^6 


= (0.7463937500000,^.2536062500000) 


0.0001 


Xl 


= (0.4927875000000, 0.5072125000000) 


0.00009 


x% 


= (0.9855750000000, 0.0144250000000) 


0.00008 
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x 9 = (0.97 1 1 500000000, 0.0288500000000) 0.00005 
jc 10 = (0.9423000000000, 0.0577000000000) 0.000000 

Figure IV. 125 illustrates the idea. 

7.2. Consider the shift dynamical system {E; T} on the code space of two symbols 
{1,2}. Show that the sequence of points {x n } given by 

jc 0 = 212, and x n = 12 for all n = 1, 2, 3, . . . 

is an errorful orbit for the system. Illustrate the divergence of T on x 0 from x n . Find 
a shadowing orbit [x n }%L 0 and verif y the error estimate provided by the Shadowing 
Theorem. 

7.3. Illustrate the Shadowing Theorem by constructing an erroneous orbit, and an 
orbit that shadows it, for the shift dynamical system {[0, 1]; \x, \x + \ ). 

7.4. Compute an orbit for a random shift dynamical system associated with the 
overlapping IFS {[0, 1]; \x, \x + \). 

7.5. An orbit of the shift dynamical system associated with the IFS 

N(;)U;K(:)H;) + GH(;M;)l- 
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is computed to accuracy 0.0005. How close a shadowing orbit does there exist? Use 
the Manhattan metric. 

7.6. In Figure IV. 126 an orbit of the random shift dynamical system associated with 
the overlapping IFS {[0, 1], w\(x), w 2 (x)} is computed by drawing a web diagram. 
The computer in this case consists of a pencil and a drafting table. Estimate the 
errors in the drawing and then deduce how closely an exact orbit shadows the plotted 
one. You will need to estimate the contractivity of the IFS. Also draw a tube around 
the plotted orbit, within which an exact orbit lies. 

7.7. Figure IV. 127 shows an orbit {x n } of the random shift dynamical system associ- 
ated with the IFS {[0, 1]; w x (x), w 2 (x)}. It was obtained by defining S(x) = w^(x) 
for x e w\(A) n w 2 (A). A contractivity factor for the IFS is readily estimated from 
the drawing to be | . Hence if the web diagram is accurate to within 1 mm at each 
iteration, that is 

d(x n+ i, S(x„)) < 1 mm, 
then there is an exact orbit [x n = S on (x 0 )}^L 0 such that 

(2) 

d(x n ,x n ) < -j- = 1.5mm. 

Thus there is an actual orbit that remains within the "orbit tube" shown in Fig- 
ure IV. 127. 
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8 Chaotic Dynamics on Fractals 

The shift dynamica! system (A; S) associated with a totally disconnected hyperbolic 
WS is equivalent to the shift dynamica. system CS. D, where S ,s the code : spaa 
associated with the IPS. As we have seen, this equivalence means tha the two 
"Is have a number of properties in common; for example the two sys ems have 
I same number of cycles of minimal period 7. A particularly tmportan, propert 
Lt they share is ma, they are both "chaotic" dynamical systems, a concept tha, we 
explain in ,his section. Firs,, however, we wan. to underline that the two systems are 
deeply different from the point of view of the interplay of then- dynam.cs w«h the 
geometry of the underlying spaces. 

Consider the case of an IFS of three transformations. Let E denote the code space 
of the three symbols ( 1 , 2, 3}, and look at the orbit of the point a e E given by 



a = 



1231112132122233132331111 

12113121122123131132133211212213 

22122222323123223331131231332132 

232333133233311111H211131121112 

21123113111321133121112121213122 

11222122312311232123313 

1113121212 FOREVER. 



8 Chaotic Dynamics on Fractals 1 




start of a chaotic orbit on 
Ternary Cantor set. 



Figure IV. 128. The 



This orbit {T on a}™ =0 may be plotted on a Cantor set of three symbols, as sketched 
in Figure IV. 128. This can be compared with the orbit {S on ((f)(a))}^L 0 of the shift 
dynamical system {A, S} associated with an IFS of three maps, as plotted in Fig- 
ure IV. 129. Figure IV. 130 shows an equivalent orbit, but this time for the just- 
touching IFS {[0, 1]; |x, 5* + s> + §}. displayed using a web diagram. 

In each case the "same" dynamics look entirely different. The qualities of beauty 
and harmony present in the observed orbits are different. This is not suprising: the 
equivalence of the dynamical systems is a topological equivalence. It does not pro- 
vide much information about the interplay of the dynamics with the geometries of 
the spaces on which they act. This interplay is an open area for research. For ex- 
ample, what are the special conserved properties of two metrically equivalent dy- 
namical systems? Can you quantify the grace and delicacy of a dancing orbit on a 
fractal? 

This said, we turn our attention back to an important collection of properties 
shared by all shift dynamical systems. For simplicity we formalize the discussion for 
the case of the shift dynamical system {A, S} associated with a totally disconnected 
hyperbolic IFS. 

Definition 8. 1 Let (X, d) be a metric space. A subset B cXis said to be dense 
in X if the closure of B equals X. A sequence {x n }^L 0 of points in X is said to be 
dense in X if for each point a e X, there is an subsequence {x an }^ =0 that converges 
to a. In particular an orbit {x n }™ =0 of a dynamical system {X, /} is said to be dense 
in X if the sequence {x n }™ =0 is dense in X. 

By now you will have had some experience with using the random iteration al- 
gorithm, Program 2 of Chapter III, for computing images of the attractor A of IFS 
in R 2 . If you run the algorithm starting from a point x 0 e A, then all of the com- 
puted points lie on A. Apparently, the sequences of points we plot are examples of 
sequences that are dense in the metric space (A,d). 

The property of being dense is invariant under homeomorphism : if B is dense in 
a metric space (X, d) and if 9 : X Y is a homeomorphism, then 6(B) is dense in 
Y. If {X; /} and {Y, g} are equivalent dynamical systems under 0; and if {x n } is an 
orbit of / dense in X, then {0(x n )} is an orbit of g dense in Y. 
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Figure IV. 130. Equiv- 
alent orbit to the one 
in Figures IV. 128 and 
IV. 129, this time ploted 
using a web diagram. The 
starting point has address 
12311121321222331 . . .. 
This manifestation of an 
orbit, which goes arbi- 
trarily close to any point, 
takes place on a just- 
touching attractor. 
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Definition 8.2 A dynamical system {X, /} is transitive if, whenever U and V are 
open subsets of the metric space (X,d), there exists a finite integer n such that 

U n / on (V) # 0. 

The dynamical system {[0, 1]; f(x) = min{2*, 2 - 2x}} is topological^ transi- 
tive. To verify this just let U and V be any pair of open intervals in the metric 
space ([0, 1], Euclidean). Clearly, each application of the transformation increases 
the length of the interval U in such a way that it eventually overlaps V. 

Definition 8.3 The dynamical system {X; /} is sensitive to initial conditions if 
there exists 8 > 0 such that, for any x € X and any ball B(x,e) with radius e > 0, 
there is y e B(x, e) and an integer n > 0 such that d(f on (x), f on (y)) > 8. 

Roughly, orbits that begin close together get pushed apart by the action of the 
dynamical system. For example, the dynamical system {[0, 1]; 2x mod 1} is sensitive 
to initial conditions. 

Examples & Exercises 

8.1 . Show that the rational numbers are dense in the metric space (R, Euclidean). 

8.2. Let C(n) be a counting function that counts all of the rational numbers that lie 
in the interval [0,1]. Let r C («> denote the nth rational number in [0,1]. Prove that the 
sequence of real numbers {r CM € [0, 1] : n = 1, 2, 3, ...} is dense in the metric space 
([0, 1], Euclidean). 

8.3. Consider the dynamical system {[0, 1]; fix) = 2x mod 1}. Find a point x Q e 
[0, 1] whose orbit is dense in [0, 1]. 

8.4. Show that the dynamical system {[0, oo) : fix) = 2x} is sensitive to initial 
conditions, but that the dynamical system {[0, oo) : fix) = (0.5)*} is not. 

8.5. Show that the shift dynamical system {E; T), where E is the code space of two 
symbols, is transitive and sensitive to initial conditions. 

8.6. Let {X, /} and {Y, g} be equivalent dynamical systems. Show that {X, /} is 
transitive if and only if {Y, g} is transitive. In other words, the property of being 
transitive is preserved between equivalent dynamical systems. 

Definition 8.4 A dynamical system {X, /} is chaotic if 

(1) it is transitive; 

(2 ) it is sensitive to initial conditions; 

(3) the set of periodic orbits off is dense in X. 

Theorem 8. 1 The shift dynamical system associated with a totally disconnected 
hyperbolic IFS of two or more transformations is chaotic. 

Sketch of Proof: First one establishes that the shift dynamical system {E; T) is 
chaotic where E is the code space of N symbols, with N > 2. One then uses the code 
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space map 0 : E -> A to carry the results over to the equivalent dynamical system 
{A; S). 

Theorem 1 applies to the lifted IFS associated with a hyperbolic IFS. Hence the 
lifted shift dynamical system associated with an IFS of two or more transformations 
is chaotic. In turn this implies certain characteristics to the behavior of the projection 
of a lifted shift dynamical system, namely a random shift dynamical system. 

Let us consider now why the random iteration algorithm works, from an intuitive 
point of view. Consider the hyperbolic IFS {R 2 ; w u w 2 }. Let a e A; suppose that the 
address of a is a e E, the associated code space. That is 

a = (p(a). 

With the aid of a random-number generator, a sequence of one million ones and 
twos is selected. For example, suppose that the the actual sequence produced is the 
following one, which has been written from right to left, 

21 ... 12121121121211121112111111211211121111211212122211 

By this we mean that the first number chosen is a 1, then a 1, then three 2's, and so 
on. Then the following sequence of points on the attractor is computed: 

a = (p(a) 

wi(a) = 0(l<r) 

w\ o w\(a) = 0(1 Icr) 

w 2 o w\ o w\(a) = 0(21 lcr) 

w 2 ow 2 ow\o w\(a) — 0(221 la) 

w 2 ow 2 ow 2 ow\o w\(a) — 0(2221 la) 

w\ o w 2 o w 2 o w 2 o u>\ o w\(a) = 0(122211cr) 

w 2 o w\ o w 2 o w 2 o w 2 o w\ o w\(a) = 0(212221 la) 

w\ o w 2 o w\ o w 2 o w 2 o w 2 o wi o w\(a) = 0(12122211a-) 

w 2 o w\ o w 2 o w\ o w 2 o w 2 o w 2 o w\ o w\(a) = 0(21212221 la) 

w\ o w 2 o w\ o w 2 o w\ o w 2 o w 2 o w 2 o w\ o w\(a) = 0(121212221 la) 

w\ o w\ o w 2 o w\ o w 2 o wi o w 2 o w 2 o w 2 o o u;i(a) = 0(1 121212221 lcr) 

«; 2 ou;io..,iijiou;ioi02 0 ii; 1 oM; 2 ou;iou)2 0u)2 0!02 0 «; 1 o«;i(o) = 0(21 . . . 11212122211a) 
We imagine that instead of plotting the points as they are computed, we keep a list of 
the one million computed points. This done, we plot the points in the reverse order 
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from the order in which they were computed. That is, we begin by plotting the point 
0(21... 11212122211a) and we finish by plotting the point 0(a). What will we 
see? We will see a million points on the orbit of the shift dynamical system {A; S}; 
namely, {5 on (0 (21 ... 1 121212221 la))}^' 000 . 

Now from our experience with shift dynamics and from our theoretical knowl- 
edge and intuitions what do we expect of such an orbit? We expect it to be chaotic 
and to visit a widely distributed collection of points on the attractor. We are looking 
at part of a "randomly chosen" orbit of the shift dynamical system; we expect it to 
be dense in the attractor. 

For example, suppose that you are doing shift dynamics on a picture of a totally 
disconnected fractal, or a fern. You should be convinced that by making sly adjust- 
ments in the orbit at each step, as in the Shadowing Theorem, you can most easily 
coerce an orbit into visiting, to within a distance e > 0, each point in the image. But 
then the Shadowing Theorem ensures that there is an actual orbit close to our artifi- 
cial one, and it too goes close to every point on the fractal, say to within a distance of 
2e of each point on the image. This suggests that "most" orbits of the shift dynamical 
system are dense in the attractor. 

Examples & Exercises 

8.7. Make experiments on a picture of the attractor of a totally disconnected hyper- 
bolic IFS to verify the assertion in the last paragraph that "by making sly adjustments 
in an orbit ... you can most easily coerce the orbit into visiting to within a distance 
€ > 0 of each point in the image." Can you make some experimental estimates of 
how many orbits go to within a distance e > 0, for several values of e, of every point 
in the picture? One way to do this might be to work with a discretized image and to 
try to count the number of available orbits. 

8.8. Run the Random Iteration Algorithm, Program 2 in Chapter III, to produce an 
image of a fractal, for example a fern without a stem as used in Figure IV. 129. As 
the points are calculated and plotted, keep a list of them. Then plot the points over 
again in reverse order, this time making them flash on and off on the picture of the 
attractor on the screen, so that you can see where they land. This way you will see 
the interplay of the geometry with the shift dynamics on the attractor. See if the orbit 
is beautiful. If you think that it is, try to make your impression objective. 

We want to begin to formulate the idea that "most" orbits of the shift dynamical 
system associated with a totally disconnected IFS are dense in the attractor. The 
following lemma counts the number of cycles of minimal period p. 

Lemma 8.1 Let {A; S} be the shift dynamical system associated with a totally 
disconnected hyperbolic IFS {X; w\, w 2 , . . . , w N }. Let N{p) denote the number of 
distinct cycles ofminimhl period p,for pe{l,2,3,...}. Then 
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n p _ ^ kAf(k) 

\ ic divides p ) 



fp for p = 1, 2, 3, 



Proof It suffices to restrict attention to code space, and to give the main idea, 
consider only the case N = 2. For p = 1, the cycles of period 1 are the fixed points 
of T. The equation 

To = aa e E 

implies o = TTTT or a = 2222. Thus Af(l) = 2. For p = 2, any point that lies on a 
cycle of period 2 must be a fixed point of T° 2 , namely 

T o2 a = a, 

where a = IT, 12, 2~T, or 22. The only cycles here that are not of minimal period 2 
must have minimal period 1. Furthermore, there are two distinct points on a cycle of 
minimal period 2, so 

Af(2) = (2 2 -Af(l))/2 = 2/2=1. 
One quickly gets the idea. Mathematical induction on p completes the proof for 
N = 2. 

For N = 2, we find, for example, Af(2) = 1, Af(3) = 2, Af(4) = 3, M(5) = 
6, Af(6) = 9, Af(7) = 18, Af(8) = 30, M(9) = 56, Af(lO) = 99, Af(ll) = 186, 
^(12) = 335, Af(13)= 630, AT(14) = 1161, Af(15) = 2182, AT(16) = 4080, 
Af(17) = 7710, Af(18) = 14532, Af(19) = 27594, Af(20) = 52377. In particular, 
99.9% of all points lying on cycles of period 20 lie on cycles of minimal period 20. 

Here is the idea we are getting at. We know that the set of periodic cycles are 
dense in the attractor of a hyperbolic IFS. It follows that we may approximate the 
attractor by the set of all cycles of some finite period, say period 12 billion. Thus we 
replace the attractor A by such an approximation A, which consists of 2 12 000 ' 000 ' 000 
points. Suppose we pick one of these points at random. Then this point is extremely 
likely to lie on a cycle of minimal period 12 billion. Hence the orbit of a point 
chosen "at random" on the approximate attractor A is extremely likely to consist 
of 12 billion distinct points on A. 

In fact one can show that a statistically random sequence of symbols contains 
every possible finite subsequence. So we expect that the set of 12 billion distinct 
points on A is likely to contain at least one representative from each part of the 
attractor! 
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Fractal Dimension 



1 Fractal Dimension 

How big is a fractal? When are two fractals similar to one another in some sense? 
What experimental measurements might we make to tell if two different fractals may 
be metrically equivalent? What is the same about the two fractals in Figure V.131? 

There are various numbers, associated with fractals, which can be used to com- 
pare them. They are generally referred to as fractal dimensions. They are attempts 
to quantify a subjective feeling we have about how densely the fractal occupies the 
metric space in which it lies. Fractal dimensions provide an objective means for 
comparing fractals. 

Fractal dimensions are important because they can be defined in connection with 
real-world data, and they can be measured approximately by means of expenments. 
For example, one can measure the "fractal dimension" of the coastline of Great 
Britain- its value is about 1.2. Fractal dimensions can be attached to clouds, trees, 
coastlines, feathers, networks of neurons in the body, dust in the air at an instant in 
time, the clothes you are wearing, the distribution of frequencies of light reflected by 
a flower, the colors emitted by the sun, and the wrinkled surface of the sea dunng a 
storm. These numbers allow us to compare sets in the real world with the laboratory 
fractals, such as attractors of IFS. 

We restrict attention to compact subsets of metric spaces. This fits well with the 
idea of modelling the real physical world by subsets of metric spaces. Suppose that 
an experimentalist is studying a physical entity, and he wishes to model this entity 
by means of a subset of R 3 . Then he can use a compact set for his model. For 
example, he can assume that the distances he measures are Euclidean distances, 
and he can assume that the universe is bounded. He can assume that any Cauchy 
sequence of points in his model set converges to a point in his model set, because 
he cannot experimentally invalidate this assumption. Although mathematically we 
can distinguish between a set and its closure, we cannot make the same distinction 
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between their physical counterparts. The assumption of compactness will allow the 
model to be handled theoretically with relative ease. 

Let (X, d) denote a complete metric space. Let A e H(X) be a nonempty com- 
pact subset of X. Let e > 0. Let B(x, e) denote the closed ball of radius e and center 
at a point x e X. We wish to define an integer, Af(A, 6), to be the least number of 
closed balls of radius e needed to cover the set A. That is 

Af(A,e)= smallest positive integer M such that Acuf =| ^ )f ), 

for some set of distinct points {x n : n = 1,2, . . . , M) c X. How do we know that 
there is such a number M{A, e)? Easy! The logic is this: surround every point x e A 
by an open ball of radius 6 > 0 to provide a cover of A by open sets. Because A is 
compact this cover possesses a finite subcover, consisting of an integer number, say 
M, of open balls. By taking the closure of each ball, we obtain a cover consisting 
of M closed balls. Let C denote the set of covers of A by at most M closed balls 
of radius e. Then C contains at least one element. Let / : C — ► {1, 2, 3, ... , M) be 
defined by /(c) = number of balls in the cover ceC. Then {/(c) : c e C} is a finite 
set of positive integers. It follows that it contains a least integer, Af(A, e). 

The intuitive idea behind fractal dimension is that a set A has fractal dimension 
Dif: 
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N(A, e) % Ce~ D for some positive constant C. 

Here we use the notation as follows. Let /(e) and g(e) be real valued functions 
of the positive real variable e. Then /(e) « g(e) means that lim e _ 0 {ln(/(O)/ In 
(g(e))} = 1. If we "solve" for D we find that 

_ \nAf(A,e) -InC 
D% ln(l/e) 

We use the notation ln(jc) to denote the logarithm to the base e of the positive real 
number x. Now notice that the term In C/ ln(l /e) approaches 0 as e -> 0. This leads 
us to the following definition. 

Definition 1.1 Let A e H{X) where (X, d) is a metric space. For each e > 0 let 
M(A, e) denote the smallest number of closed balls of radius e > 0 needed to cover 
A. If 



D = ^o{ ln(l/e) } 



exists, then D is called the fractal dimension of A. We will also use the notation 
D = D(A) and will say "A has fractal dimension D." 

Examples & Exercises 

1.1. This example takes place in the metric space (R 2 , Euclidean). Let a e X and 
let A = {a}. A consists of a single point in the space. For each e > 0, M(A, e) = 1. 
It follows that D(A) = 0. 

1.2. This example takes place in the metric space (IR 2 , Manhattan). Let A denote 
the line segment [0, 1]. Let e > 0. Then it is quite easy to see that M(A, e) = 
-[-1/e], where [jc] denotes the integer part of the real number x. In Figure V.132 
we have plotted the graph of ln(Af(A, e)) as a function of ln(l/e). Despite a rough 
start, it appears clear that 

r [ln(Af(A,e))l 1 
SS| ln(l/e) | = L 

In fact for 0 < e < 1 

ln(l/e) ln(— [—!/€]) _ ln(A/"(A,e)) ^ ln(l/e + 1) _ ln(l + e) + ln(l/e) 
lrToTT) - ln(l/e) " ln(l/e) ~ ln(l/e) ln(l/e) 
Both sides here converge to 1 as e ^ 0. Hence the quantity in the middle also 
converges to 1. We conclude that the fractal dimension of a closed line segment is 
one. We would have obtained the same result if we had used the Euclidean metric. 

1 .3. Let (X, d) be a metric space. Let a, b, c e X, and let A = {a, b, c}. Prove that 
D(A) = 0. 1 

The following two theorems simplify the process of calculating the fractal dimen- 
sion. They allow one to replace the continuous variable e by a discrete variable. 
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Figure V.132. Plot of 
ln([ 1 /jc]) as a function of 
ln( 1 /jc). This illustrates 
that in the computation 
of the fractal dimension 
one usually evaluates 
the limiting "slope" of a 
discontinuous function. In 
the present example this 
slope is 1. 



Ln([l/X]) 



1t 



Ln(l/X) 



Theorem 1.1 Let A e H(X), where (X, d) is a metric space. Let € n = Cr n for 
real numbers 0 < r < 1 and C > 0, and integers n = 1,2,3, .... If 

n^oo [ ln(l/e„) 
then A has fractal dimension D. 

Proof Let the real numbers r and C, and the sequence of numbers E = {e„:n = 
1, 2, 3, . . .} be as defined in the statement of the theorem. Define /(e) = max{e„ e 
E : € n < €}. Assume that e < r. Then 

/(<?) <<? < f(€)/r and M{A, f(e))>N(A,€)>N(A, f(e)/r). 
Since \n(x) is an increasing positive function of x for x > 1, it follows that 
\ \n(N(A,f(e)/r)) \ ^ \ \n{N(A,e)) \ 

I ln(l//(«)) I ~ I ln(l/6) J (1) 

\n(M (A, f{e))) 1 

ln(r//(6)) j • (2) 

Assume that7V(A; e) oo as e 0; if not then the theorem is true. The right-hand 
side of equation 2 obeys 

lim .ln(^(A,/(0))l ^ \MMiA,e n)) 



o\ ln(r//(e)) j ln(r/e„) 

\n(M(A, €„)) 



= lim 



"^oo [ln(r) +ln(l/e w ) 



= lim 



t ln(Af(A,€ n )) } 
I ln(l/€„) J 



1 Fractal Dimension 1 75 



The left-hand side of equation 2 obeys 



\ \n(Af(A,f(€)/r)) 
ZS>\ ln(l//(0) 



= lim 



= lim 



ln(Af(A,6„-i)) 
ln(l/€„) 
f ln(Af(A,6„_i)) 
lln(l/r)+ln(l/€„_i)) 
f ln(Af(A,6„)) l 
• 1 ln(l/€„) J ' 



ln(l/€„) 

So as € ^ 0 both the left-hand side and the right-hand side of equation 2 approach 
the same value, claimed in the theorem. By the Sandwich Theorem of calculus, the 
limit as € ^ 0 of the quantity in the middle of equation 2 also exists, and it equals 
the same value. This completes the proof of the theorem. 

Theorem 1.2 The Box Counting Theorem. Let A e H(R m ), where the Eu- 
clidean metric is used. Cover R m by closed square boxes of side length (1/2"), as 
exemplified in Figure V.133 for n = 2andm=2. Let N n {A) denote the number of 
boxes of side length (1/2") which intersect the attractor. If 

f ln(X,(A)) | 
1 ln(2») )' 



D= lim 



ln(2") 

then A has fractal dimension D. 

Proof We observe that for m = 1 , 2, 3, . . ., 

2- m M n -i < M(A, 1/2") < Af m for all n = 1, 2, 3, ... , 

where k(n) is the smallest integer k satisfying k > n - 1 + \ log 2 m. The first in- 
equality holds because a ball of radius 1/2" can intersect at most 2 m "on-grid" boxes 
of side 1 /2"- 1 . The second follows from the fact that a box of side s can fit inside a 
ball of radius r provided r 2 > (f ) 2 + (|) 2 + cldots + (|) 2 = m(|) 2 by the theorem 
of Pythagoras. Now 

ln(A/" fc („)) 



lim 



ln(2") 



f ln(2 fe (">) lnCAfc w ) , _ p 
n^ot ln(2») ln(2 fc W) 



since ^ 1. Since also 



lim 

n-»-oo 



( ln2-"^-i | _ f ln^,-i 1 _ D 
1 ln(2») ) n^oo [ln(2"- 1 ) J 



ln(2") 

Theorem 1.1 with r = 1/2 completes the proof. 

There is nothing magical about using boxes of side (1/2)" in Theorem 1.2. One 
can equally well use boxes of side Cr", where C > 0 and 0 < r < 1 are fixed real 
numbers. 
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Figure V.133. Closed 
boxes of side (1/2") 
cover [R 2 . Here n = 2. See 
Theorem 1.2. 
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Figure V.134. it re- 1 

quires (l/2")~ 2 boxes 
of side (1/2") to cover 
■ Cl 2 .We deduce, with 
a feeling of relief, that the 
fractal dimension of ■ is 
2. To which collage is this 
related? 




Examples & Exercises 

1 .4. Consider the ■ c R 2 . It is easy to see that M { (■) = 4, M 2 (m) = 16, Af 3 (m) = 64, 

Af 4 (m) = 256, and in general that M n (m) = 4" for n = 1, 2, 3, . . .; see Figure V.134. ' 
Theorem 1.2 implies that 



[ ln(2") J ( in(2") J 



1.5. Consider the Sierpinski triangle A, in Figure V.135, as a subset of (IR 2 , 
Euclidean). 

We see that A/"i(A) = 3, Af 2 (A) = 9,A/" 3 (A) = 27,AT 4 (A) = 81, and in general 
JV„(A) = 3 B for /i = 1,2,3,.... 
Theorem 1.2 implies that 

D(A)=lim \ ln(K(A) )=lim »U!^ 
n-^oo [ in(2») j n^oo \ \ n (2 n ) J In (2) ' 

1.6. Use the Box Counting Theorem, but with boxes of side length (1/3)", to cal- 
culate the fractal dimension of the classical Cantor set C described in exercise 1.5 in 
Chapter III. 

1 .7. Use the Box Counting Theorem to estimate the fractal dimension of the fractal 
subset of R 2 shown in Figure V.136. You will need to take as your first box the 
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obvious one suggested by the figure. You should then find that there appears to be 
a pattern to the sequence of numbers M\ , ■ • ■• 

1.8. The same problem as 1.7, this time applied to Figure V.137. By making the 
right choice of Cartesian coordinate system, you will make this problem easy. 

What happens to the fractal dimension of a set if we deform it "with bounded 
distortion"? The following theorem tells us that metrically equivalent sets have the 
same fractal dimension. For example, the two fractals in Figure V.131 have the same 
fractal dimension! 

Theorem 1.3 Let the metric spaces (X u d x ) and (X 2 , d 2 ) be metrically equiv- 
alent. Let 0 : Xi -> X 2 be a transformation that provides the equivalence of the 
spaces. Let A x e H(X { ) have fractal dimension D. Then A 2 = 9{A X ) has fractal di- 
mension D. That is 

D(A x ) = D(0{A 2 )). 

Proof This proof makes use of the concepts of the lim sup and lim inf of a func- 
tion. (The lim sup is discussed briefly following Definition 2.1 in the next section.) 

Since the two spaces (*,,«/,) and (X 2 ,d 2 ) are equivalent under 0, there exist 
positive constants e\ and e 2 such that 

ei d 2 (e(x), 000) < ddx, y) < e 2 d 2 (0(x), 6(y)) for all x,yeX,. (3) 
Without loss of generality we assume that e x < 1 < e 2 . Equation 3 implies 

di(x,y) 



d 2 (6(x),6(y))< 



for all x, y e X\. 
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Figure V.136. What 
other well-known frac- 
tal has the same fractal 
dimension? 



1^ Lb Lb LL jb Lb. ib Lb 
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This implies 

6>(#(*,e)) C fl(6>(x), <?M)) for all xeX x . (4) 

Now, from the definition of N(Ai,e), we know that there is a set of points 
[xi,X2,---,xtf] C Xi, where M = M(A\,e), such that the set of closed balls 
{B(x n , €) :n — 1,2, .. . ,N(A\, e)} provides a cover of A i. It follows that {9(B(x n , 
€)) : n = 1, 2, . . . , A/"(Ai, e)} provides a cover of A 2 . Equation 4 now implies that 
{fl(#(x n ), €/ei)) : n = 1, 2, . . . , A/"(Ai, e)} provides a cover of A 2 . Hence 

A/'(A 2 ,e/e 1 )<A/'(A 1 ,e). 



Hence, when e < 1 , 



ln(A/"(A 2 , €M)) < ln(jV(Ai,€)) 



ln(l/€) " ln(l/€) 

It follows that 

fln(A/ -(A 2 ,Q) l 

lim sup j — — > (5) 
e ^ 0 I ln(l/€) J 

f ln(AA(A 2 ,6/ gl )) ] ^. [ lnCAT^Lc)) ] 

= hm sup { — — \ < lim { — — } = Z)(Ai). (6) 

1 ln(l/€) J ln(l/€) ] 

We now seek an inequality in the opposite direction. Equation 3 implies that 

d\{B~\x), 9~\y)) < e 2 d 2 (x, y) for all x,ye X 2 . 



This tells us that 



1 Fractal Dimension 




Figure V.137. If you 

choose the "first" box just 
right, the fractal dimen- 
sion of this fractal is easily 
estimated. Count the num- 
ber N„ of boxes of side 
1 /2 n which intersect the 
set, for n = 1, 2, 3, . . ., 
and apply the Box Count- 
ing Theorem. 



0-\B(x, €)) C B(6- l (x), e 2 €)) for all x e X 2 , 
and this in turn implies 

Af(A u e 2 e)<U(A 2 ,e). 



Hence, when e < 1 , 



It follows that 



\n(Af(A u e 2 €)) \n(Af(A 2 , e)) 



ln(l/€) 



ln(l/€) 



r [lnCA/" (Ai,€)) 
D(AO = hm — 
€-^o [ ln(l/e) 

(ln(A^(Ai,e 2 e)) 1 

= S?ol ln(l/0 1 

r . r (ln(A/" (A 2 ,€)) 
< hm inf { — — 
€-^o [ ln(l/e) 



By combining equations 6 and 7 we obtain 



ln^A 2 ,e \n(AT(A 2 , e)) 

lim inf — VtTT^ — = D < Al) = hm SU P i n n u\ 
€^o I ln(l/e) J €^o I ln(l/e) 



From this it follows that 



i . ln(Af (A 2 , c)) 1 

D(A 2 ) = lim \ \ . . = D(Ai). 
I ln(l/e) J 

This completes the proof. 



(7) 
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Examples & Exercises 

1 .9. Let C denote the classical Cantor set, living in [0, 1] and obtained by omitting 
"middle thirds." Let C denote the Cantor set obtained by starting from the closed 
interval [0, 3] and omitting "middle thirds." Use Theorem 1.3 to show that they 
have the same fractal dimension. Verify the conclusion by means of a box-counting 
argument. 

1.10. Let A be a compact nonempty subset of R 2 . Suppose that A has fractal di- 
mension D\ when evaluated using the Euclidean metric and fractal dimension Di 
when evaluated using the Manhattan metric. Show that D\ = D 2 . 

1.11. This example takes place in the metric space (!R 2 , Manhattan). Let A\ and Ai 
denote the attractors of the following two hyperbolic IFS 

{R 2 ; wi(x, y), w 2 (x, y), w 3 (x, y)} and {R 2 ; w 4 (x, y), w 5 (x, y), w 6 (x, y)}, 



By finding a suitable change of coordinates, show A\ and A 2 have the same fractal 
dimensions. 



The following definition extends Definition 1.1. It provides a value for the fractal 
dimension for a wider collection of sets. 

Definition 2.1 Let (X, d) be a complete metric space. Let A e H(X). LetM(() 
denote the minimum number of balls of radius e needed to cover A. If 



where 




and 
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exists, then D is called the fractal dimension of A. We will also use the notation 
D = D(A), and will say "A has fractal dimension D." 

In stating this definition we have "spelled out" the lim sup. For any function /(e), 
defined for 0 < e < 1, for example, we have 

lim sup /(e) = lim{sup{/(e) : e e (0, e)}}. 

It can be proved that Definition 2.1 is consistent with Definition 1.1: if a set has 
fractal dimension D according to Definition 1.1 then it has the same dimension 
according to Definition 2.1. Also, all of the theorems in this book apply with either 
definition. The broader definition provides a fractal dimension in some cases where 
the previous definition makes no assertion. 

Theorem 2.1 Let m be a positive integer; and consider the metric space (OT, 
Euclidean). The fractal dimension D(A) exists for all A e H(R m ). Let B e H(R m ) 
be such that A C B; and let D(B) denote the fractal dimension ofB. Then D(A) < 
D(B). In particular, 

0 < D(A) < m. 

Proof We prove the theorem for the case m = 2. Without loss of generality we 
can suppose that A c ■• It follows that J\f (A, e) < Af(m, e) for all e > 0. Hence for 
all e such that 0 < e < 1 we have 

ln(Af(A,e)) < ln(Afc, e)) 
°" ln(l/e) " ln(l/e) 

It follows that 

\\n(M(A,e))\ [ \n(Af(m, e)) 

lim sup -4-77777- ^ limsU P IndM " 



The lim sup on the right-hand side exists and has value 2. It follows that the lim sup 
on the left-hand side exists and is bounded above by 2. Hence the fractal dimension 
D(A) is defined and bounded above by 2. Also D(A) is nonnegative. 

If A, Be H(R 2 ) with A c B, then the fractal dimensions of A and B are defined. 
The above argument wherein ■ is replaced by B shows that D(A) < D(B). This 
completes the proof. 

The following theorem helps us to calculate the fractal dimension of the union of 
two sets. 

Theorem 2.2 Let m be a positive integer; and consider the metric space (R m , 
Euclidean). Let A and B belong to H(R m ). Let A be such that its fractal dimension 
is given by 

1 \\n(M(A,e))\ 
D(A) = lim { | ■ 

6->o I ln(l/e) ] 
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Let D(B) and D(A U B) denote the fractal dimensions ofB and A U B, respectively. 
Suppose that D(B) < D(A). Then 

D(AU B) = D(A). 

Proof From Theorem 2.1 it follows that D(A U B) > D(A). We want to show 
that D(A U B) < D(A). We begin by observing that, for all e > 0, 

Af(A U B, e) < Af(A, 6) + Af(fl, 6). 

It follows that 



f ln(Af(A U B, €)) 1 
D(AUB) = lim sup \ ' 

I ln(l/e) j 



fln(AT(A,6)+AT(fi,6)) 

< lim sup { 

- 6 _ 0 F l ln(l/6) 

(ln(AT(A,6))l \\n(l +Af(B,€)/Af(A, € ))\ 

< hm sup \ — — — — — [ + lim sup { — [ . 

6 _o I ln(l/€) ) I ln(l/6) j 

The proof is completed by showing that Af(B, e)/N(A, e) is less than 1 when e is 
sufficiently small. This would imply that the second limit on the right here is equal 
to zero. The first limit on the right converges to D(A). 
Notice that 

\\n(Af(B,€)) „ 

sup { : € < € 

I ln(l/6) 

is a decreasing function of the positive variable 6. It follows that 
\n(Af(B, e)) 

— , , , — < D(A) for all sufficiently small e > 0. 
ln(l/e) 

Because the limit explicitly stated in the theorem exists, it follows that 
\n(Af(B,€)) \n(Af(A,e)) 

— — < : — for all sufficiently small e > 0. 

ln(l/e) ln(l/e) 

This allows us to conclude that 
N(B,e) 



N(A,e) 
This completes the proof. 



< 1 for all sufficiently small e > 0. 



Examples & Exercises 

2. 1 . The fractal dimension of the hairy set A c suggested in Figure V.138, is 2. 
The contribution from the hairs to Af(A, e) becomes exponentially small compared 
to the contribution from ■, as 6 -> 0. 

We now give you a wonderful theorem that provides the fractal dimension of the 
attractor of an important class of IFS. It will allow you to estimate fractal dimensions 
"on the fly," simply from inspection of pictures of fractals, once you get used to it. 
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Figure V.138. Picture 
of a hairy box. The fractal 
dimension of the subset 
of R 2 suggested here is 
the same as the fractal 
dimension of the box. The 
hairs are overpowered. 



Theorem 2.3 Let {R m ; w u u> 2 , • • • , w N } be a hyperbolic IFS, and let A de- 
note its attractor. Suppose w n is a similitude of scaling factor s n for each n e 
{1, 2, 3, ... , N}. If the IFS is totally disconnected or just-touching then the attractor 
has fractal dimension D(A), which is given by the unique solution of 

N 

J2\s n \ D(A) = h D(A)e[0,m]. 
If the IFS is overlapping, then D > D(A), where D is the solution of 

N _ 

]T|5„| D = 1, De[0,oo). 

n=l 

Sketch Of proof The full proof can be found in [Bedford 1986], [Hardin 1985], 
[Hutchinson 1981], and [Reuter 1987]. The following argument gives a valuable 
insight into the fractal dimension. We restrict attention to the case where the IFS 
{R m ; w u w 2 , . ■ ■ , wn} is totally disconnected. We suppose that the scaling factor s, 
associated with the similitude w, is nonzero for each i e {1, 2, . . . , N}. Let e > 0. 
We begin by making two observations. 

Observation (i): Let i e {1, 2, . . . , N}. Since is a similitude of scaling factor 
si, it maps closed balls onto closed balls, according to 

Wi(B(x,€)) = Biwiix), \si\e). 
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Assume that s, ^ 0 . Then is invertible, and obtain 

w7 1 (B(x,€)) = B(w7 1 (x),\s i \- 1 €). 
The latter two relations allow us to establish that for all 6 > 0, 

Af(A,t)=Af(w i (A),\s i \t); 

which is equivalent to 

ff(w i (A),€)=ff(A,\s i \- 1 €). (1) 

This applies for each i e {1, 2, 3, ... , N}. 

Observation (ii): The attractor A of the IFS is the disjoint union 

A = wi(A) U w 2 (A) U . . . U w N (A), 

where each of the sets w n (A) is compact. Hence we can choose the positive number 
6 so small that if, for some point x e 1R 2 and some integer / e { 1 , 2, . . . , N}, we have 
B(x, e) n Wi(A) ^ 0, then B(x, e) n wj(A) = 0 for all j e {1, 2, ... , N} with j ^ /. 
It follows that if the number € is sufficiently small we have 

Af(A, 6) = M(Wi(A), 6) + M(w 2 (A), 6) + M(W 3 (A), 6) + • • • + M(W N (A), 6) 

We put our two observations together. Substitute from equation 1 into the last equa- 
tion to obtain 

Af(A, €)=Af(A, \ Sl \-^)+Af(A, \s 2 r l t) 

+ Af(A, |s 3 | -1 €) + • • -+Af(A, Is/vP 1 *?)- 

This functional equation is true for all positive numbers € that are sufficiently small. 
The proof is completed by showing formally that this implies the assertion in the 
theorem. 

Here we demonstrate the reasonableness of the last step. Let us make the assump- 
tion Af(A, €) ~ Ce~ D . Then substituting into equation 2 we obtain the equation: 

C € -° * C\s x \ D ^ D + C\s 2 \ D e~ D + C|5 3 | D 6- D + • • • + C\s N \ D e- D . 

From this we deduce that 

1 = |J1| D +|J 2 | D +|J3| D + ---+I^| D . 

This completes our sketch of the proof of Theorem 2.3. 

Examples & Exercises 

2.2. This example takes place in the metric space (1R 2 , Euclidean). A Sierpinski 
triangle is the attractor of a just-touching IFS of three similitudes, each with scaling 
factor 0.5. Hence the fractal dimension is the solution D of the equation 

(0.5) D + (0.5) D + (0.5) D = 1 
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Figure V.139. The 

Castle fractal. This is 
an example of a self- 
similar fractal, and its 
fractal dimension may 
be calculated with the 
aid of Theorem 2.3. The 
associated IFS code is 
given in Table V. 1 . 



from which we find 



D = 



ln(l/3) _ ln(3) 
ln(0.5) ~ ln(2). 



2.3. Find a just-touching IFS of similitudes in R 2 whose attractor is ■. Verify that 
Theorem 2.3 yields the correct value for the fractal dimension of ■. 

2.4. The classical Cantor set is the attractor of the hyperbolic IFS 

1 1 2, 

{[0, 1]; wi(x) = -x; wdx) = -x + -}. 

Use Theorem 2.3 to calculate its fractal dimension. 

2.5. The attractor of a just-touching hyperbolic IFS {Q& 2 ; u>,-(x), i = 1, 2, 3, 4} is 
represented in Figure V.139. The affine transformations w { :R 2 ^R 2 are simili- 
tudes and are given in tabular form in Table VI. Use Theorem 2.3 to calculate the 
fractal dimension of the attractor. 

2.6. The attractor of a just-touching hyperbolic IFS {R 2 ; Wi(x), i = 1, 2, 3} is rep- 
resented in Figure V.140. The affine transformations w t : K 2 -> R 2 are similitudes. 
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Table V. 1 . IFS code for a Castle. 



w 


a 


b 


c 


d 


e 


/ 


P 


1 


0.5 


0 


0 


0.5 


0 


0 


0.25 


2 


0.5 


0 


0 


0.5 


2 


0 


0.25 


3 


0.4 


0 


0 


0.4 


0 


1 


0.25 


4 


0.5 


0 


0 


0.5 


2 


1 


0.25 



Figure V.140. To cal- 
culate the fractal dimen- 
sion of the subset of R 2 
represented here, first ap- 
ply the Collage Theorem 
to find a corresponding set 
of similitudes. Then use 
Theorem 2.3. 



Use the Collage Theorem to find the similitudes, and then use Theorem 2.3 to calcu- 
late the fractal dimension of the attractor. 

2.7. Figure V.141 represents the attractor of an overlapping hyperbolic IFS 

{R 2 ; Wi(x),i = 1,2,3,4}. 

Use the Collage Theorem and Theorem 2.3 to obtain an upper bound to the fractal 
dimension of the attractor. 

2.8. Calculate the fractal dimension of the subset of R 2 represented by Figure V.142. 

2.9. Consider the attractor A of a totally disconnected hyperbolic IFS 

{R 7 ; Wi(x), i = l,2} 

where the two maps 

io i : R 7 -> R 7 and w 2 : R 7 -> R 7 

are similitudes, of scaling factors s\ and S2, respectively. Show that A is also the 
attractor of the totally disconnected hyperbolic IFS {R 7 ; Vi(x), i = 1, 2, 3, 4} where 
uj = w\ o w\, V2 = w i o u>2, u 3 = if 2 o w\, and v\ = w 2 o it> 2 . Show that Vi(x) is a 
similitude, and find its scaling factor, for ? = 1,2,3, 4. Now apply Theorem 2.3 to 
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Figure V.141. An up- 
per bound to the fractal 
dimension of the attractor 
of an overlapping IFS, cor- 
responding to this picture, 
can be computed with the 
aid of Theorem 2.3. 
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yield two apparently different equations for the fractal dimension of A. Prove that 
these two equations have the same solution. 



3 The Experimental Determination of the Fractal Dimension 

In this section we consider the experimental determination of the fractal dimension 
of sets in the physical world. We model them, as best we can, as subsets of (Q& 2 , 
Euclidean) or (Q& 3 , Euclidean). Then, based on the definition of the fractal dimen- 
sion, and sometimes in addition on one or another of the preceding theorems, such 
as the Box Counting Theorem, we analyze the model to provide a fractal dimension 
for the real-world set. 

In the following examples we emphasize that when the fractal dimension of a 
physical set is quoted, some indication of how it was calculated must also be pro- 
vided. There is not yet a broadly accepted unique way of associating a fractal di- 
mension with a set of experimental data. 

Example 

3.1. There is a curious cloud of dots in the woodcut in Figure V.143. Let us try to 
estimate its fractal dimension by direct appeal to Definition 1.1. 

We begin by covering the cloud of points by disks of radius e for a range of e- 
values from e = 3 cm down to e = 0.3 cm; and in each case we count the number of 



Figure V.143. Cover- 
ing a cloud of dots in a 
woodcut by balls of radius 

€ >0. 
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Table V.2. Minimal numbers of balls, of various radii, needed to cover 
a "dust" in a woodcut. 



€ 




3 cm 


I 


2 cm 


3 


1.5 cm 


4 


1.2 cm 


6 


1 cm 


7 


0.75 cm 


10 


0.5 cm 


16 


0.4 cm 


23 


0.3 cm 


31 


0.015 cm 


267 



Table V.3. The data in Table III. 1 is tabulated in log-log form. These values 
are used to obtain the fractal dimension. 



ln(l/e) \n{M{A,t)) 



-1.1 


0.69 


-0.69 


1.09 


-0.405 


1.39 


-0.182 


1.79 


0 


1.95 


0.29 


2.30 


0.693 


2.77 


0.916 


3.13 


1.204 


3.43 


4.2 


5.59 



disks needed. This provides the set of approximate values for Af(A, <?) given in Table 
III. 1. The data is redisplayed in log-log format in Table V.3. The data in Table V.3 is 
plotted in Figure V.144. A straight line that approximately passes through the points 
is drawn. The slope of this straight line is our approximation to the fractal dimension 
of the cloud of points. 

The experimental number Af(A, 0.015 cm) is not very accurate. It is a very rough 
estimate based on the size of the dots themselves and is not included in the plot in 
Figure V.144. The slope of the straight line in Figure V.144 gives 

D(A)~1.2, over the range 0.3 cm to 3 cm, (1) 

4 

where A denotes the set of points whose dimension we are approximating. 

The straight line in Figure V.144 was drawn "by eye." Thus if one was to repeat 




the experiment, a different value for D(A) may be obtained. In order to make the re- 
sults consistent from experiment to experiment, the straight line should be estimated 
by a least squares method. 

In proceeding by direct appeal to Definition 1.1, the estimates of N(A, e) need 
to be made very carefully. One needs to be quite sure that M(A, e) is indeed the 
least number of balls of radius e needed. For large sets of data this could be very 
time-consuming. 

It is clearly important to state the range of scales used: we have no idea or defi- 
nition concerning the structure of the dots in Figure V. 143 at higher resolutions than 
say 0.015 cm. Moreover, regardless of how much experimental data we have, and 
regardless of how many scales of observation are available to us, we will always end 
up estimating the slope of a straight line corresponding to a finite range of scales. If 
we include the data point (0.015 cm, 267) in the above estimation, we obtain 

D(A) ~ 0.9, over the range of scales 0.015 to 5 cm. (2) 

We comment on the difference between the estimates in equations 1 and 2. If we 
restrict ourselves to the range of scales in equation 1 , there little information present 
in the data to distinguish the cloud of points from a very irregular curve. However, 
the data used to obtain equation 2 contains values for Af(A, e) for several values of 
e such that the corresponding coverings of A are disconnected. The data is "aware" 
that A is disconnected. This lowers the experimentally determined value of D. 

3.2. In this example we consider the physical set labelled A in Figure V.145. A is 
actually an approximation to a classical Cantor set. In this case we make an exper- 
imental estimate of the fractal dimension, based on the Box Counting Theorem. A 
Cartesian coordinate system is set up as shown and we attempt to count the num- 
ber of square boxes M n (A) of side (l/2 n ) which intersect A. We are able to obtain 
fairly accurate values of Af n (A) for n = 0, 1, 2, 3, 4, 5, and 6. These values are pre- 
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Figure V.145. Succes- 
sive subdivision of over- 
laying grid to obtain the 
box counts needed for 
the application of Theo- 
rem 1.2 to estimate the 
fractal dimension of the 
Cantor set A. The counts 
are presented in Table V.4. 



0 



Table V.4. The data determined from Figure 145, in the experimental calculation 
of the fractal dimension of the physical set A. 



n 




In Af n (A) 


n In 2 


0 


1 


0 


0 


1 


3 


1.10 


0.69 


2 


7 


1.95 


1.38 


3 


10 


2.30 


2.08 


4 


19 


2.94 


2.77 


5 


33 


3.50 


3.46 


6 


58 


4.06 


4.16 



sented in Table V.4. We note that these values depend on the choice of coordinate 
system. Nonetheless the values of Af n (A) are much easier to measure than the values 
of N(A, e) used in example 3.1. 

The analysis of the data proceeds just as in example 3.1. It is represented in 
Table V.4 and Figure V.146. We obtain 

1 

D(A) ~ 0.8, over the range - inch to 8 inches. 

8 

3.3. In this example we show how a good experimentalist [Strahle 1987] overcomes 
the inherent difficulties with the experimental determination of fractal dimensions. 
In so doing he obtains a major scientific result. The idea is to compare two sets of 
experimental data, obtained by different means, on the same physical system. The 




physical system is a laboratory jet flame. The data are time series for the temperature 
and velocity at two different points in the jet. The idea is to apply the same procedure 
to the analysis of the two sets of data to obtain a value for the fractal dimension. 
The two values are the same. Instead of drawing the conclusion that the two sets of 
data "have the same fractal dimension," he deduces that the two sets of data have a 
common source. That common source is physical, real-world chaos. 

The experimental setup is as follows. A flame is probed by (a) a laser beam and 
(b) a very thin wire. These two probes, coupled with appropriate measuring devices, 
allow measurements to be made of the temperature and velocity in the jet, at two 
different points, as a function of time. In (a) the light bounces off the fast-moving 
molecules in the exhaust, and a receiver measures the characteristics of the bounced 
light. The output from the receiver is a voltage. This voltage, suitably rescaled, gives 
the temperature of the jet as a function of time. In (b) a constant temperaiure is 
maintained through a wire in the flame. The voltage required to hold the temperature 
constant is recorded. This voltage, suitably rescaled, gives the velocity of the jet as 
a function of time. In this way we obtain two independent readings of two different, 
but related, quantities. 

Of course the experimental apparatus is much more sophisticated than it sounds 
from the above description. What is important is that the measuring devices are of 
very high resolution, accuracy, and sensitivity. A reading of the velocity can be made 
once every microsecond. In this example the temperature was read every 0.5 x 10~ 4 
sec. Vast amounts of data can be obtained. A sample of the experimental output from 
(a) is shown in Figure V.147, where it is represented as the graph of voltage against 
time. It is a very complex curve. If one "magnifies up" the curve, one finds that its 
geometrical complexity in the curve continues to be present. It is just the sort of thing 
we fractal geometers like to analyze. 

A sample of the experimental output from (b) is shown in Figure V.148, again 
represented as a graph of voltage against time. You should compare Figures V.147 
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ROV LEIGH SCATTERING VOLTAGE 




Figure V.147. Graph 
of voltage as a function of 
time from an experimental 
probe of a turbulent jet. 
In this case the probe 
measures scattering of a 
laser beam by the flame. 



HOI FILM VOLTAGE 



u v v vv v u 

* Expanded Area 
» 
» 




Figure V.148. Graph 
of voltage as a function of 
time from an experimental 
probe of a turbulent jet. In 
this case the probe mea- 
sures the voltage across 
a wire in the flame. This 
data has a definite fractal 
character, as demonstrated 
by the expanded piece 
shown in Figure V.149. 



and V.148. They look different. Is there a relationship between them? There should 
be: they both probe the same burning gas and they are in the same units. 

In order to bring out the fractal character in the data, an expanded piece of the 
data in Figure V. 148 is shown in Figure V. 149. 

The fractal dimensions of the graphs of the two time series, obtained from (a) 
and (b), is calculated using a method based on the Box Counting Theorem. Exactly 
the same method is applied to both sets of data, over the same range of scales. Fig- 
ure V. 150 shows the graphical analysis of the resulting box counts. Both experiments 
yield the same value 

D % 1.5, over the range of scales 2 6 x 1(T 5 sec to 2 13 x 1(T 5 sec. 

This suggests that, despite the different appearances of their graphs, there is a com- 
mon source for the da^a. 

We believe that this common source is chaotic dynamics of a certain special flavor 
and character, present in the jet flame. If so then fractal dimension provides an 
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experimentally measurable parameter that can be used to characterize the brand of 
choas. 

3.4. Use a method based on the direct application of Definition 1.1 to make an 
experimental determination of the fractal dimension of the physical set defined by 
the black ink in Figure V.151. Give the range of scales to which your result applies. 

3.5. Use a method based on the Box Counting Theorem, as in Example 3.2, to es- 
timate the fractal dimension of the "random dendrite" given in Figure V.152. State 
the range of scales over which your estimate applies. Make several complete experi- 
ments to obtain some idea of the accuracy of your result. 

3.6. Make an experimental estimate of the fractal dimension of the dendrite shown 
in Figure V.153. Note that a grid of boxes of size (l/12)th inch by (l/12)th inch has 
been printed on top of the dendrite. Compare the result you obtain with the result 
of exercise 3.5. It is important that you follow exactly the same procedure in both 
experiments. 

3.7. Make an experimental determination of the fractal dimension of the set in 
Figure V.142. Compare your result with a theoretical estimate based on Theorem 2.3, 
as in exercise 2.7. 

3.8. Obtain maps of Great Britain of various sizes. Make an experimental determi- 
nation of the fractal dimension of the coastline, over as wide a range of scales as 
possible. 

3.9. Obtain data showing the variations of a Stock Market index, at several different 
time scales, for example, hourly, daily, monthly, and yearly. Make an experimental 
determination of the fractal dimension. Find a second economic indicator for the 
same system and analyze its fractal dimension. Compare the results. 



Figure V.149. A EXPANDED AREA HOI FILM VOLTAGE 

blowup of a piece of the 
graph in Figure V.148. 
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Kolmogorov line : -1.7 
Integral line : 5,4 
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Figure V.150. Graphi- 
cal analysis of box counts 
associated with experi- 
ments (a) and (b). The 
data analyzed is illus- 
trated in Figures V.145 
and V.148. The two data 
streams, which come from 
probes of a single turbu- 
lent system, when ana- 
lyzed in exactly the same 
way, yield the same value 
D = 1 .5 for the fractal di- 
mension. This suggests 
that, despite the differ- 
ent appearances of their 
graphs, there is a com- 
mon source for the data. 
This source is chaotic dy- 
namics of a certain special 
flavor and character. The 
fractal dimension provides 
a measurable symptom of 
the brand of chaos. 



4 The Hausdorff-Besicovitch Fractal Dimension 

The Hausdorff-Besicovitch dimension of bounded subset of R m is another real num- 
ber that can be used to characterize the geometrical complexity of bounded subsets 
of R m . Its definition is more complex and subtle than that of the fractal dimension. 
One of the reasons for its importance is that it is associated with a method for com- 
paring the "sizes" of sets whose fractal dimensions are the same. It is harder to work 
with than the fractal dimension, and its definition is not usually used as the basis of 
experimental procedures for the determination of fractal dimensions of physical sets. 

Throughout we work in the metric space (R m , d) where m is a positive integer and 
d denotes the Euclidean metric. Let A c R m be bounded. Then we use the notation 

diam(A) = sup{d(x, y) : x, y e A}. 

•* 

Let 0 < 6 < oo, and 0 < p < oo. Let A denote the set of sequences of subsets 
{A, c A}, such that A = U^A,. Then we define 
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Figure V.151. Make 
an experimental estimate 
of the fractal dimension of 
the set A of black ink, 
above, over the range 
of scales 5 inches to 
0.1 inches. Base your 
experimental method 
directly on Definition 1.1. 
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Figure V. 152. Make an experimental estimate of the fractal dimension of this set in (a), 
over the range of scales 5 inches to (1/1 2)th inch, basing your method on the Box Counting 
Theorem and graphical analysis. In order to help you with your work, in (b) we have overlaid 
the set a grid of boxes (1/1 2)th inch by (l/12)th inch. 



Figure V.153. Make 
an experimental estimate 
of the fractal dimension 
of the random dendrite 
shown here. Note that 
a grid of boxes of size 
(l/12)th inch by (l/12)th 
inch has been printed 
on top of the dendrite. 
Compare the experimental 
fractal dimension here 
with that of the dendrite in 
Figure V.152. In advance, 
which one do you expect 
will have lower fractal 
dimension? 
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M(A,p,€) = 

!oo 
^(diam(A I )K : {A,-} e A, and diam(A,) < e for / = 1, 2, 3, . . . 




Here we use the convention that (diam(A,))° = 0 when A,- is empty. M(A, p, e) is 
a number in the range [0, oo]; its value may be zero, finite, or infinite. You should 
verify that it is a nonincreasing function of e. We now define 



Then for each p e [0, oo] we have M(A, p) e [0, oo]. 

Definition 4. 1 Let m be a positive integer and let Abe a bounded subset of the 
metric space {R m , Euclidean). For each p € [0, oo) the quantity M(A, p) described 
above is called the Hausdorff /^-dimensional measure of A. 

Examples & Exercises 

4.1 . Show that M. (A, p) is a nonincreasing function of p e [0, oo]. 

4.2. Let A denote a set of seven distinct points in (0& 2 , Euclidean). Show that 
M(A, 0) = 7 and M(A, p) = 0 for p > 0. 

4.3. Let A denote a countable infinite set of distinct points in (R 2 , Euclidean). 
Show that M(A, 0) = oo and M(A, p) = 0 for p > 0 

4.4. Let C denote the classical Cantor set in [0,1]. Show that M(C, 0) = oo and 
M(C, 1) = 0. 

4.5. Let A denote a convenient Sierpinski triangle. Show that 1) = oo and 
M(A, 2) = 0. Can you evaluate 7W(A, ln(3)/ ln(2))? At least try to argue why this 
might be an interesting number. 

The Hausdorff /^-dimensional measure M(A, p), as a function of p e [0, oo], 
behaves in a remarkable manner. Its range consists of only one, two, or three values! 
The possible values are zero, a finite number, and infinity. In Figure V.154 we 
illustrate this behavior when A is a certain Sierpinski triangle. 

Theorem 4. 1 Let m be a positive integer. Let Abe a bounded subset of the met- 
ric space (R m , Euclidean). Let Ai(A, p) denote the function of p G [0, oo) defined 
above. Then there is a unique real number D H e [0, m] such that 



Proof This can be found, for example, in [Federer 1969], section 2.10.3. 

Definition 4.2 Let m be a positive integer and let A be a bounded subset of 
the metric space (R m , Euclidean). The corresponding real number D H , occurring 
in Theorem 4.1, is called the Hausdorff-Besicovitch dimension of the set A. This 
number will also be denoted by D H (A). 



M(A, p) = supLW(A, p, €):€> 0}. 



M(A,p) = 



oo 



0 



if p < D H and p e [0, oo), 
if p > D H and p e [0, oo). 
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INFINITY 



M(p) 




Figure V.154. Graph 
the function .M(A, p) 
when A is a certain Sier- 
pinski triangle. It takes 
only three values. 



Theorem 4.2 Let m be a positive integer and let A be a subset of the metric 
space (R m , Euclidean). Let D(A) denote the fractal dimension of A and let D H (A) 
denote the Hausdorff-Besicovitch dimension of A. Then 



0 < D H (A) < D(A) < m. 



Examples & Exercises 

4.6. Describe a situation where you would expect D H (A) < D(A). 

4.7. Prove Theorem 4.2. 

Theorem 4.3 Let m be a positive integer. Let {R m ; w u w 2 , ■ ■ ■ , w N } he a hy- 
perbolic IFS, and let A denote its attractor. Let w„ be a similitude of scaling factor 
s n for each n e {1,2,3, ... , N}. If the IFS is totally disconnected or just-touching, 
then the Hausdorff-Besicovitch dimension D H (A) and the fractal dimension D(A) 
are equal. In fact D(A) = D H (A) = D, where D is the unique solution of 

N 

J2\s n \ D = U De[0,m]. 

If D is positive, then the Hausdorff D-dimensional measure M(A, D H (A)) is a 
positive real number, i 

Proof This can be found in [Hutchinson 1981]. 
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In the situation referred to in Theorem 4.3 the Hausdorff D//(A)-dimensional 
measure can be used to compare the "sizes" of fractals that have the same frac- 
tional dimension. The larger the value of M(A, D H (A)), the "larger" the fractal. 
Of course, if two fractals have different fractal dimensions, then we say that the one 
with the higher fractal dimension is the "larger" one. 



Examples & Exercises 

4.8. Here we provide some intuition about the functions A4(A, p, e) and A4(A, p), 
and the "sizes" of fractals. We illustrate how these quantities can be estimated. The 
type of procedure we use can often be followed for attractors of just-touching and 
totally disconnected IFS whose maps are all similitudes and should lead to correct 
values. Formal justification is tedious and follows the lines suggested in [Hutchinson 
1981]. 

Consider the Sierpinski triangle A with vertices at (0,0), (0, 1), and (1,0). 
We work in R 2 with the Euclidean metric. We begin by estimating the number 
M(A, p, e) for p e [0, 1] for various values of e. The values of e we consider are 
e = y/2(l/2) n for n = 0, 1, 2, 3, . . .. Now notice that A can be covered very effi- 
ciently by 3" closed disks of radius \/2(l/2)". We guess that this covering is one for 
which the infinum in the definition of M(A, p, e = \/2(l/2)") is actually achieved. 
We obtain the estimate 

A4(A, p, V2(l/2) n ) = 3 n (V2) p (l/2) np for n = 1, 2, 3, ... . 

The supremum in the definition of A4(A, p) can be replaced by a limit; so we 
obtain 

M(A,p)= lim {3 n (V2) p (l/2) np } 

n-*oo 

oo if p < ln(3)/ln(2), 

(v /2) ln (2)/in(3) [{p = ln ( 3 )/ ln ( 2 ), 

0 if p > ln(3)/ln(2). 

This tells us that D H (A) = ln(3)/ln(2), which we already know from Theo- 
rem 4.3. It also tells us that M(A, D H (A)) = ( > /2)in(2)/in(3)_ This is our est i mate 
of the "size" of the particular Sierpinski triangle under consideration. 

If one repeats the above steps for the Sierpinski triangle A with vertices at 
(0, 0), (0, 1/V2), and (1/V2, 0), one finds M(A, D H (A)) = 1. Thus A is "small- 
er" than A. Similar estimates can be made for pairs of attractors of totally discon- 
nected or just-touching IFS whose maps are similitudes and whose fractal dimen- 
sions are equal. The comparison of "sizes" becomes exciting when the two attractors 
are not metrically equivalent. 

4.9. Estimate the "sizes" of the two fractals represented in Figure V. 155. Which one 
is "largest"? Does the computed estimate agree with your subjective feeling about 
which one is largest? 
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Figure V.155. The 

two images here rep- 

^J* resent the attractors of 

»i •« *i two different IFS of the 

- * " t**S form {1R 2 ; W\, W2, wj,}, 

'"J* £i* V „ where all of the maps 

:» <r» £' are similitudes of seal- 

s' ►„ ing factor 0.4. Both sets 

v ^»»* " have the same fractal di- 

«; °* mension ln(3)/ ln(2.5). 

So which one is the 

^»-» "J.** **»*. "largest"? Compare their 

V ** .* v " "sizes" by estimating their 

-\f " * Hausdorff ln(3)/ ln(2.5)- 

*2 dimensional measures. 



4.10. Prove that the Hausdorff-Besicovitch dimension of two metrically equivalent 
bounded subsets of (R m , Euclidean) is the same. 

4.11. Let d denote a metric on which is equivalent to the Euclidean metric. 
Let A denote a bounded subset of K 2 . Suppose that d is used in place of the Eu- 
clidean metric to calculate a "Hausdorff-Besicovitch" dimension of A, denoted by 
D H (A). Prove that D H (A) = D H (A). Show, however, that the "size" of the set, 
M(A, Dh(A)), may be different when computed using d in place of the Euclidean 
metric. 

4.12. If distance in fe 2 is measured in inches, and a subset A of K 2 has fractal 
dimension 1.391, what are the units of M{A, 1.391)? 
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Figure V.156. Why 

are the fractal dimen- 
sion and the Hausdorff- 
Besicovitch dimension of 
the attractor of the IFS 
represented by this image 
equal? 




4.13. The image in Figure V.156 represents the attractor A of a certain hyperbolic 
IFS. 

(1) Explain, with support from appropriate theorems, why the fractal dimension 
D and the Hausdorff-Besicovitch dimension D H of the attractor of the IFS 
are equal. 

(2) Evaluate D. 

(3) Using inches as the unit, compare the Hausdorff-Besicovitch D-dimensional 
measures of A and w(A), where w{A) denotes one of the small "first- 
generation" copies of A. 

4.14. By any means you like, estimate the Hausdorff-Besicovitch dimension of the 
coastline of Baron von Koch's Island, shown in Figure V.157. It is recommended 
that theoreticians try to make an experimental estimate, and that experimentalists try 
to make a theoretical estimate. 

4.15. Does the work of some artists have a characteristic fractal dimension? Make 
a comparison of the empirical fractal dimensions of Romeo and Juliet, over an 
appropriate range of scales; see Figure V.158. 
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Figure V.157. By any 

means you like, estimate 
the Hausdorff-Besicovitch 
dimension of the coastline 
of Baron von Koch's 
Island. 



Chapter VI 

Fractal Interpolation 



1 



Introduction: Applications for Fractal Functions 



Euclidean geometry, trigonometry, and calculus have taught us to think about mod- 
elling the shapes we see in the real world in terms of straight lines, circles parabolas, 
and other simple curves. Consequences of this way of thinking are abundant in our 
everyday lives. They include the design of household objects; the common usage of 
drafting tables, straight-edges, and compasses; and the "applications" that accom- 
pany introductory calculus courses. We note in particular the provision of functions 
for drawing points, lines, polygons, and circles in computer graphics software such 
as MacPaint and Turbobasic. Most computer graphics hardware is designed specifi- 
cally to provide rapid computation and display of classical geometrical shapes. 

Euclidean geometry and elementary functions, such as sine, cosine, and poly- 
nomials, are the basis of the traditional method for analyzing experimental data. 
Consider an experiment that measures values of a real-valued function F(x) as a 
function of a real variable x. For example, F(x) may denote a voltage as a unction 
of time as in the experiments on the jet-engine exhaust described in Example 3.3 in 
Chapter V The experiment may be a numerical experiment on a computer. In any 
case the result of the experiment will be a collection of data of the form: 

{{XhF . ):i= 0, 1,2,..., AO- 
Here N is a positive integer, F, = F(* f ), and the *,-'s are real numbers such that 



Xq < Xi < X 2 < *3 < 



< Xft . 



The traditional method for analyzing this data begins by representing it graphi- 
cally as a subset of R 2 . That is, the data points are plotted on graph paper. Next the 
graphical data is analyzed geometrically. For example, one may seek a straight line 
"egment that is a good approximation to the graph of the data. Or else, one might 
construct a polynomial of as low degree as possible, whose graph is a good fit to 
the data over the interval [* 0 , x N ]. In place of a polynomial, a linear combination of 
elementary functions might be used. The goal is always the same: to represent the 
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Figure VI. 159. Illus- 
tration of the process 
whereby experimental 
data is represented graph- 
ically and modelled geo- 
metrically by means of a 
classical geometrical en- 
tity, such as a straight line 
or a polynomial fit to the 
data. 




data, viewed as a subset of R 2 , by a classical geometrical entity. This entity is repre- 
sented by a simple formula, one that can be communicated easily to someone else. 
The process is illustrated in Figure VI. 159. 

Elementary functions, such as trigonometric functions and rational functions, 
have their roots in Euclidean geometry. They share the feature that when their graphs 
are "magnified" sufficiently, locally they "look like" straight lines. That is, the tan- 
gent line approximation can be used effectively in the vicinity of most points. More- 
over, the fractal dimension of the graphs of these functions is always 1 . These el- 
ementary "Euclidean" functions are useful not only because of their geometrical 
content but because they can be expressed by simple formulas. We can use them 
to pass information easily from one person to another. They provide a common lan- 
guage for our scientific work. Moreover, elementary functions are used extensively 
in scientific computation, computer-aided design, and data analysis because they can 
be stored in small files and computed by fast algorithms. 

Graphics systems founded on traditional geometry are effective for making pic- 
tures of man-made objects, such as bricks, wheels, roads, buildings, and cogs. This 
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is not suprising, since these objects were designed in the first place using Euclidean 
geometry. However, it is desirable for graphics systems to be able to deal with a 

wider range of problems. 

In this chapter we introduce fractal interpolation functions. The graphs of these 
functions can be used to approximate image components such as the profiles of 
mountain ranges, the tops of clouds, stalactite- hung roofs of caves, and honzons 
over forests, as illustrated in Figure VI.160. Rather than treating the image compo- 
nent as arising from a random assemblage of objects, such as individual mountains, 
cloudlets, stalactites, or tree tops, one models the image component as an interre- 
lated single system. Such components are not well described by elementary func- 
tions or Euclidean graphics functions. 

Fractal interpolation functions also provide a new means for fitting experimen- 
tal data Clearly it does not suffice to make a polynomial "least-squares" fit to the 
wild experimental data of Strahle for the temperature in a jet exhaust as a function 
of time, as illustrated in Figure V.147. Nor would classical geometry be a good tool 
for the analysis of voltages at a point in the human brain as read by an electoren- 
cephalograph. However, fractal interpolation functions can be used to "fit such ex- 
perimental data: that is, the graph of the fractal interpolation function can be made 
close in the Hausdorff metric, to the data. Moreover, one can ensure that the fractal 
dimension of the graph of the fractal interpolation function agrees with that of the 
data over an appropriate range of scales. This idea is illustrated in Figure VI. 161 

Fractal interpolation functions share with elementary functions that they are of a 
geometrical character, that they can be represented succinctly by "formulas," and 
that they can be computed rapidly. The main difference is their fractal character. 
For example, they can have a noninteger fractal dimension. They are easy to work 
with- once one is accustomed to working with sets rather than points and with 
IFS theory using affine maps. If we start to pass them from one to another, fractal 
functions will become part of the common language of science. So read on! 



Examples & Exercises 

1 1 Write an essay on the influences of Euclidean geometry on the way in which 
we view the physical world. How does fractal geometry change that view? 

1 2 Find the linear approximation /(*) to the function /(*) = sin(*), about the 
point x = 0 Let e > 0. Find the linear change of coordinates (*', /) = 8(x, y) in 
R\ such that OHO, 6] x [0, 6]) = [0, 1] x [0, 1]. Let /'(*') denote the function / (*) 
represented in the new coordinate system. Let /'(*') denote the function /(*) in 
the new coordinate system. Let L denote the graph of /'(*') for x' € [0, 1] and let G 
denote the graph of f{x') for x' e [0, 1]. How small must e be chosen to ensure that 
the Hausdorff distance from L to G is less than 0.01? The Hausdorff distance should 
be computed with respect to the Manhattan metric in R 2 . 
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Figure VI. 160. The 

fractal interpolation func- 
tions introduced in this 
chapter may be used in 
computer graphics soft- 
ware packages to provide 
a simple means for render- 
ing profiles of mountain 
ranges, the tops of clouds, 
and horizons over forests. 




2 Fractal Interpolation Functions 



Definition 2.1 A set of data is a set of points of the form F f ) e R 2 : i = 
0, 1, 2, ... , N}, where 

Xo < Xi < X2 < *3 < • ■ • < Xfj. 

An interpolation function corresponding to this set of data is a continuous function 
f : [xq, xyv] — > t& such that 
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Figure VI. 161. This 
figure illustrates the idea 
of using a fractal inter- 
polation function to fit 
experimental data. The 
graph of the interpolation 
function may be close, in 
the Hausdorff metric, to 
the graph of the experi- 
mental data. The fractal 
dimension of the interpo- 
lation function may agree 
with that of the data over 
an appropriate range of 
scales. 



f( Xi ) = Fi for/ = 1,2, N. 

The points (x f , F, ) e R 2 are called the interpolation points. We say that the function 
f interpolates the data and that (the graph of) f passes through the interpolation 
points. 

Examples & Exercises 

2.1 . The function f(x) = 1 + x is an interpolation function for the set of data 

{(0, 1), (1,2)}. 

Consider the hyperbolic IFS {R 2 ; w\, W2}, where 

-CM? £)(;)♦(£)• 
«(;)-(?£)(;)♦(?)■ 

Let G denote the attractor of the IFS. Then it is readily verified that G is the straight 
line segment that connects the pair of points (0, 1) and (1, 2). In other words, G is 
the graph of the interpolation function f(x) over the interval [0,1]. 

2.2. Let {(*,-, Fi) : i — 4 0, 1,2, ...,N} denote a set of data. Let / : [x 0 , x N ] R 
denote the unique continuous function that passes through the interpolation points 
and is linear on each of the subintervals [.*,■_ 1, *,■]. That is, 



THE EXPERIMENTAL DATA 
AND THE FRACTAL FUNCTION 




THE GRAPH OF A FRACTAL 
INTERPOLATION FUNCTION 
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Figure VI. 162. Graph 
of the piecewise linear 
interpolation function 
fix) through the inter- 
polation points {(Fi, xi) : 
i = 0, 1,2,3,4}. This 
graph is also the attrac- 
tor of an IFS of the form 
{[R 2 ; w n ,n = 1,2,3,4}, 
where the maps are affine. 



(Vo>- 



Lx 2 .f 2 ) 



X l X 2 



(X 4 .F 4 ) 



IX3.F3) 



fix) = Fi-i + — ^——-(Fi - Fi-i) for x e *,■], i = 1, 2, . . . , N. 

(Xj — Xi-i) 

The function f(x) is called a piecewise linear interpolation function. The graph of 
fix) is illustrated in Figure VI. 162. This graph, G, is also the attractor of an IFS of 
the form {R 2 ; w n , n = 1 , 2, . . . , N}, where the maps are affine. In fact, 

»■(;)=(:: 3(;) + G)- 

where 

ix n — X n -\) _ jx^x n -\ — XpX n ) 

ix N - xo) n (*n - xo) 

(F.-F^i) t C» W F-i-^F.) | forw = 1 , 2 w . 

ix N - x 0 ) ix N — Xo) 

Notice that the IFS may not be hyperbolic with respect to the Euclidean metric in 
M 2 . Can you prove that, nonetheless, G is the unique nonempty compact subset of 
M 2 such that 

G = U% =l w n iG)l 

2.3. Verify the claims in exercise 2.2 in the case of the data set {(0, 0), (1, 3), (2, 0)} 
by applying either the Deterministic Algorithm, Program 1, Chapter III, or the Ran- 
dom Iteration Algorithm, Program 2 of Chapter III. You will need to modify the 
programs slightly. 

2.4. The parabola defined by fix) = 2x - x 2 on the interval [0, 2] is an interpo- 
lation function for the set of data {(0, 0), (1, 1), (2, 0)} . Let G denote the graph of 
fix). That is 

G = {(x, 2x -x 2 ):xe [0, 2]}. 
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Then we claim that G is the attractor of the hyperbolic IFS {R 2 ; w u w 2 }, where 

W2 



CM", .yen:)- 



We verify this claim directly. We simply note that for all x e [0, 2], 

i „ \ / i 

/ * \ I 

Wi 



(fix)) ( 2 (^)-(^) 2 ) (/(£*)) ' 
( x \ ( \ + \x \ / 1 + ^ \ 

\f(x)J \ 2 {i + \ X ) - a + \x) 2 J V/a + i*)/ 



I X \ I It ^ X / 1 -L I 

W2 

\JKX)/ \zu + 2 *, i 2 - 

As x varies over [0, 2], the right-hand side of the first equation yields the part of the 
graph of f{x) lying over the interval [0, 1], while the right-hand side of the second 
equation yields the part of the graph of f(x) lying over the interval [1,2]. Hence 
G = wi(G) U w 2 (G). Since G e H(R 2 ) we conclude that it is the attractor of the 
IFS. Notice that the IFS is just-touching. 

2.5. Find a hyperbolic IFS of the form {R 2 ; w\, w 2 }, where w\ and w 2 are affine 
transformations in R 2 , whose attractor is the graph of the quadratic function that 
interpolates the data {(0, 0), (1, 1), (2, 4)}. 

Let a set of data {(*,-, F { ) : i = 0, 1, 2, . . . , N} be given. We explain how one can 
construct an IFS in R 2 such that its attractor, which we denote by G, is the graph 
of a continuous function / : [x 0 , x N ] -> R, which interpolates the data. Throughout 
we will restrict our attention to affine transformations. The usage of more general 
transformations is discussed in [Barnsley 1988f]. 

We consider an IFS of the form {R 2 ; w n , n = 1, 2, . . . , N}, where the maps are 
affine transformations of the special structure 

•■CM: :)(;)♦(!)■ 

The transformations are constrained by the data according to 

The situation is summarized in Figure VI. 163. 

Letn€{l,2,3,...,Af}. The transformation w n is specified by the five real num- 
bers a n ,b n ,c n ,d n , and e n , which must obey the four linear equations 

dn x 0 + e n = *n-l, 
^■n^N "i~ &n = %n > 
C n X 0 + d n F 0 + f n = F n -\, 

1 c n x N + d n F N + f n = F n . 
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Figure VI. 163. Two il- 
lustrations showing how 
an IFS of shear transfor- 
mations is used to con- 
struct a fractal interpola- 
tion function. (Produced 
by Peter Massopust.) 



(X 0 ' F 0> 




(X 4 .F 4 ) 



X 3 X 4 



K 




It follows that there is effectively one free parameter in each transformation. We 
choose this parameter to be d n for the following reason. The transformation w n is 
a shear transformation: it maps lines parallel to the y-axis into lines parallel to the 
y-axis. Let L denote a line segment parallel to the y-axis. Then w n {L) is also a line 
segment parallel to the y-axis. The ratio of the length of w n (L) to the length of L 
is |</„|. We call d n the vertical scaling factor in the transformation w n . By choosing 
d n to be the free parameter, we are able to specify the vertical scaling produced by 
the transformation. With d n = 0, n = 1, 2, . . . , N, one recovers the piecewise linear 
interpolation function. In section 6.2 we will show that these parameters determine 
the fractal dimension of the attractor of the IFS. 

Let d n be any real number. We demonstrate that we can always solve the above 
equations for a n ,c n ,e n , and /„ in terms of the data and d n . We find 
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(x, 



71 



— *n-l) 



(1) 



(x N - x 0 ) 

(x^Xn-i — XqX h ) 



(2) 



fn = 



(x N — x 0 ) 
(F n - F„_i) d n (F N - Fp) 

(x N - x 0 ) (x N - x 0 ) 
(x N F n -i — xqF„) (xnFq — x 0 F n ) 



(3) 



(4) 



(X N - X 0 ) U (X N — X 0 ) 



Now let {[R 2 ; w n , n = 1,2, . . . , N} denote the IFS defined above. Let the vertical 
scaling factor d n obey 0 < d n < 1 for n = 1, 2, . . . , N. Even with this condition, the 
IFS is not in general hyperbolic on the metric space (R 2 , Euclidean). Despite this, let 
us see what happens if we apply the Random Iteration Algorithm to the IFS. 

Here we present Program 2 in Chapter III modified so that the input data consists 
of the interpolation points and vertical scaling factors. It is written for N = 3 and the 
data set 



The vertical scaling factors are input by the user during execution of the code. The 
program calculates the coefficients in the shear transformations from the data, and 
then applies the Random Iteration Algorithm to the resulting IFS. The program is 
written in BASIC. It runs without modification on an IBM PC with enhanced graph- 
ics adaptor and Turbobasic. On any line the words preceded by a ' are comments: 
they are not part of the program. 

Program 1 . 

x[0] =0 : x[l] =30 : x[2] =60 : x[3] =100 ' Data set 
F[0] =0 : F[l] =50 : F[2] =40 : F[3] =10 
'Vertical Scaling Factors 

input "enter scaling factors d(l) , d(2) , and d(3)", 
d(l) ,d(2) ,d(3) 

'Calculate the shear transformations from the Data 
'and Vertical Scaling Factors 
for n =1 to 3 



{(0, 0), (30, 50), (60, 40), (100, 10)}. 



b = x[3]-x[0] :' 1 a[n] = (x[n] -x[n-l] )/b 
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e[n] = (x[3]*x[n-l]-x[0]*x[n])/b 

c[n] = (F[n]-F[n-l]-d[n]*(F[3]-F[0]))/b 

f f [n] = (x [3] *F [n- 1] -x [0] *F [n] -d [n] * (x [3] *F [0] -x [0] *F [3] ) ) /b 
next 

screen 2 : els ' initalialize graphics 

window (0,0) -(100, 100) 'change this to zoom and/or pan 

x =0; y =0 'initial point from which the random 
'iteration begins 

for n =1 to 1000 'Random Iteration Algorithm 

k =int(3*rnd-0.0001)+l 

newx = a[k]*x + e[k] 

newy = c[k]*x + d[k]*y + f f [k] 

x =newx : y =newy 

pset(x,y) 'plot the most recently computed point on the screen 

next 

end 

The result of running an adaptation of this program on a Masscomp workstation 
and then printing the contents of the graphics screen is presented in Figure VI. 164. 
In this case d x = 0.5, d 2 = -0.5, and d 3 = 0.23. Notice that if the size of the plotting 
window is decreased, for example by replacing the window call by WINDOW (0,0)- 
(50,50), then a portion of the image is plotted at a higher resolution. The number of 
iterations can be increased to impove the quality of the computed image. 

Examples & Exercises 

2.6. Rewrite Program 1 in a form suitable for your own computer environment, then 
run it and obtain hardcopy of the output. 

2.7. Vary the data used by Program 1. Verify, by means of computergraphical exper- 
iments, that the corresponding IFS always seems to have a unique attractor, provided 
that the vertical scaling factors are less than 1 in norm. Verify that, provided suffi- 
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Figure VI. 164. There- 
suit of running Program 1 
with vertical scaling fac- 
tors 0.5, -0.5, and 0.23. 
It appears that the corre- 
sponding IFS possesses 
a unique attractor that 
is the graph of a func- 
tion that passes through 
the interpolation points 
{(0,0), (30, 50), (60, 40), 
(100, 10)}. Is there a met- 
ric such that the IFS is 
hyperbolic? 



ciently many points are plotted, the attractor always contains the data points, and 
that it looks like the graph of a function. 

2.8. Show that the shear transformations w n , descibed above, need not be contrac- 
tions in the Euclidean metric, even though the magnitudes of the vertical scaling 
factors are less than 1. Once you have found such an example, use Program 1, suit- 
ably modified, to obtain graphical evidence concerning the possible existence of an 
attractor. You are supposed to discover that even though the IFS is not hyperbolic in 
the Euclidean metric, it appears to possess an attractor. 

2.9. Use Program 1 to verify that the attractor of the IFS in exercise 2.4 is a 
parabola. 

We now give the theoretical basis for our experimental observations. 

Theorem 2.1 Let N be a positive integer greater than 1. Let [R 2 ;w„,n = 
1,2, ...,N} denote the IFS defined above, associated with the data set 

{(x n ,F n ):n = l,2,...,N}. 

Let the vertical scaling factor d n obey 0 < d n < 1 for n = 1, 2, . . . , N. Then there is 
a metric donR 2 , equivalent to the Euclidean metric, such that the IFS is hyperbolic 
with respect tod. In particular, there is a unique nonempty compact set Gel 2 such 
that 

G = \J N n=x w n {G). 

Proof We defirie a metric d on R 2 by 

d((x u yi), (x 2 , yi)) = l*i - *i\ + - yil 
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where 0 is a positive real number, which we specify below. We leave it as an exercise 
to the reader to prove that this metric is equivalent to the Euclidean metric on IR 2 . 
Let n e {1,2, . . . , N} . Let the numbers a n ,c n ,e n ,f n , be defined by equations 1, 2, 
3, and 4. Then we have 

d(w n (x u yi), w n (x 2 , y 2 )) 

= d((a n xi + e n , c n xi + d n y x + /„), (a n x 2 + e n , c n x 2 + d n y 2 + /„)) 
= a n \xi -x 2 \ +6\c n (xi - x 2 ) +d n (yi - y 2 )\ 
< (\a n \ + 9\c n \)\x\ - x 2 \ +6\d n \\y x - y 2 \. 

Now notice that \a n \ = \x n — x n -i\/\x N - x 0 \ < 1 because N > 2. If c\ = c 2 ■ • • = 
c n = 0, then we choose 6 = 1. Otherwise we choose 

min{(2-|a B l):n = l,2,... > jV} 
_ max{2|c„| : n = 1,2, . . . , N} 

Then it follows that 

d(w n (x u yi), w n (x 2 , y 2 )) < (\a n \ +0\c n \)\x l - x 2 \ + 0\d n \\y x - y 2 \ 

< a\x x - x 2 \ + G8\yi - y 2 \ 

< max{fl, 8}d((x u y0, (x 2 , y 2 )), 

where 

max{|fl„| : n = 1, 2, . . . , N} 
a = (1 + a n ) < 1, 

5=max{|rf„| :n = 1,2, . . . , N} < 1. 
This completes the proof. 

Theorem 2.2 Let N be a positive integer greater than 1. Let {IR 2 ; w n ,n = 
1, 2, . . . , N} denote the IFS defined above, associated with the data set {(x n , F n ) : 
n = 1, 2, . . . , N}. Let the vertical scaling factor d n obeyO <d n <l forn = 1,2,..., 
N, so that the IFS is hyperbolic. Let G denote the attractor of the IFS. Then G is 
the graph of a continuous function f : [xo,x^] — > IR, which interpolates the data 
{(Xi, Fi) : i = 1,2, . . . , N}.That is, 

G = {(x, f(x)) : x e [x 0 , x N ]}, 

where 

f( Xi ) = Fi for/=0, 1,2, 3, ...,N. 

Proof Let T denote the set of continuous functions / : [xo, x\] — > IR such that 
f(xo) = F 0 and f(x N ) = F N .We define a metric d on T by 

d(f, g) = max{\f(x) - g(x)\ : x e [x 0 ,x N ]} for all /, g in T. 

Then {T, d) is a complete metric space — see, for example [Rudin 1966], or prove it 
yourself. 
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Let the real numbers a n , c„, e n , /„, be denned by equations 1, 2, 3, and 4. Define 
a mapping T : T T by 

(r/)(x) = cZ-H*) + d n f{r n \x)) + /„ for x 6 [*„-i, *„], for n = 1, 2, . . . , N, 
where /„ : [x 0 , x N ] -+ [*„-i, *„] is the invertible transformation 

/„(x) = a„x +e n . 

We verify that T does indeed take T into itself. Let / e T. Then the function 
(Tf)(x) obeys the endpoint conditions because 

(Tf)(x 0 ) = ci/fH^o) + dif(li\x 0 )) + fn 

= cix 0 + dif(x 0 ) + fn = cix 0 + diF 0 + /„ 

= F 0 



and 



(r/)(jc w ) = c N l N \x N ) + d N f(! N \x N )) + f N 
= c N x N + d N f{x N ) + fN 

= CnXn + ds^N- 

The reader can prove that {Tf){x) is continuous on the interval [x„_i, *«] for n = 
12 N Then it remains to demonstrate that (r/)(x) is continuous at each 
of the points xi. x 2 . x* . . . . At each of these points the value of ■ (T/)(x) is 
apparently defined in two different ways. For n s {1, 2, . . . , N - 1} we have 

(Tf){X H ) = Cn+ltiiM + d n+ lf(i;U(x n )) + fn+l 

= C n+ lXo + d„+i/(xo) + = ^« 

and also 

(Tf){Xn) = cj; l (x n ) + d n f{l-\x n )) + /„ 

= C n XN + d n f{XN) + fn = F n , 

so both methods of evaluation lead to the same result. We conclude that T does 

indeed take T into T. T 
Now we show that T is a contraction mapping on the metric space (T, d). Let 

/, g 6 T. Let n 6 {1, 2, . . . , N) and x 6 [x„-i, x„]. Then 

|(r/)(jc) _ (rg )(x)i = Kii/a- 1 W) - 1 ^ w« *>■ 

It follows that 

d(Tf Tg) < g) where 3 = max{|d„| : n = 1, 2, . . . , N} < 1. 
We conclude that T : ^ - ^ is a contraction mapping. The Contraction Mapping 
Theorem implies that T possesses a unique fixed point in T. That is, there exists a 
function / e J 7 such that 

"V/)(*) = for ^ x 6 t* 0, Xn ^' 
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Figure VI. 165. A 

sequence of functions 
{/„+,(*) = (Tf n )(x)} 
converging to the fixed 
point of the mapping 
T : T -* T used in the 
proof of Theorem 2.2. 
This is another example 
of a contraction mapping 
doing its work. 




The reader should convince himself that / passes through the interpolation points. 

Let G denote the graph of /. Notice that the equations that define T can be 
rewritten 

(Tf)(a n x + e n ) = c n x + d n f(x) + f n for x e [x 0 , x N ], for n = 1, 2, . . . , N, 
which implies that 



But G is a nonempty compact subset of IR 2 . By Theorem 2.1 there is only one 
nonempty compact set G, the attractor of the IFS, which obeys the latter equation. It 
follows that G = G. This completes the proof. 

Definition 2.2 The function f(x) whose graph is the attractor of an IFS as 
described in Theorems 2.1 and 2.2, above, is called a fractal interpolation function 
corresponding to the data {(*,•, F, ) : / = 1, 2, . . . , N). 

Figure VI. 165 shows an example of a sequence of iterates {T on fo :n=0, 1, 2, 3, 
. . .} obtained by repeated application of the contraction mapping T, introduced in 
the proof of Theorem 2.2. The initial function f 0 (x) is linear. The sequence con- 
verges to the fractal interpolation function /, which is the fixed point of T. Notice 
that the whole image can be interpreted as the attractor of an IFS with condensation, 
where the condensation set is the graph of the function fo(x). 

The reader may wonder, in view of the proof of Theorem 2.2, why we go to the 
trouble of establishing that there is a metric such that the IFS is contractive. After all, 
we could simply use T to construct fractal interpolation functions. The answer has 
two parts, (a) and (b). (a) We can now apply the theory of hyperbolic IFS to fractal 
interpolation functions. Of special importance, this means that we can use IFS algo- 
rithms to compute fractal interpolation functions, that the Collage Theorem can be 
used as an aid to finding fractal interpolation functions that approximate given data, 



G = U^ lWn (G). 
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and that we can use the Hausdorff metric to discuss the accuracy of approximation of 
experimental data by a fractal interpolation function, (b) By treating fractal interpo- 
lation functions as attractors of IFS of affine transformations we provide a common 
language for the description of an important class of functions and sets: the same 
type of formula, namely an IFS code, can be used in all cases. 

One consequence of the fact that the IFS {R; w n , n = 1,2, . . . , N} associated 
with a set of data {(*„, F n ) : n = 1,2, . . . , N] is hyperbolic is that any set A 0 € 
H(R 2 ) leads to a Cauchy sequence of sets {A n } that converges to G in the Hausdorff 
metric. In the usual way we define W : H(R 2 ) -» H(R 2 ) by 

W(B) = VJ N n=l w n (B) for all B € H(R 2 ). 

Then {A n = W on (Ao)} is a Cauchy sequence of sets which converges to G in the 
Hausdorff metric. This idea is illustrated in Figure VI. 166. Notice that if A 0 is the 
graph of a function / 0 ef then A n is the graph of T° n fo. 

Examples & Exercises 

2.10. Prove that the metric on R 2 introduced in the proof of Theorem 2.1 is equiva- 
lent to the Euclidean metric on R 2 . 

2.1 1 . Use the Collage Theorem to help you find a fractal interpolation function that 
approximates the function whose graph is shown in Figure VI. 167. 

2.12. Write a program that allows you to use the Deterministic Algorithm to com- 
pute fractal interpolation functions. 

2.13. Explain why Theorems 2.1 and 2.2 have the restriction that N is greater 
than 1. 

2.14. Let a set of data {(*,-, F t ) : i = 0, 1, 2, . . . , N} be given. Let the metric space 
{T, d) and the transformation T : T -» T be defined as in the proof of Theorem 2.2. 
Prove that if / € T then Tf is an interpolation function associated with the data. 
Deduce that if / e T is a fixed point of T then / is an interpolation function 
associated with the data. 

2.15. Make a nonlinear generalization of the theory of fractal interpolation func- 
tions. For example, consider what happens if one uses an IFS made up of nonlinear 
transformations w n : R 2 -> R 2 of the form 

u)„(x, y) = (a n x + e n , c n x + d n y + g n y 2 + fn), 

where a n ,c n ,d n ,g n , and /„ are real constants. This example uses "quadratic scaling" 
in the vertical direction instead of linear scaling. Determine sufficient conditions for 
the IFS to be hyperbolic, with an attractor that is the graph of a function that inter- 
polates the data {(*,-, F,) : i = 0, 1, 2, . . . , N}. Note that in certain circumstances the 
IFS generates the graph of a differentiable interpolation function. 

2.16. Let f(x) denote a fractal interpolation function associated with a set of data 
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Figure VI. 166. Exam- 
ples of the convergence 
of a sequence of sets {A n } 
in the Hausdorff metric, 
to the graph of a fractal 
interpolation function. 





© 
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Figure VI. 167. Use 

the Collage Theorem to 
find an IFS {U 2 ; w u w 2 }, 
where W\ and w 2 are 
shear transformations 
on K 2 , such that the 
attractor of the IFS is a 
good approximation to 
the graph of the function 
shown here. 



{(jc ( , Fi) : i = 0, 1, 2, . . . , N}, where N > 1. Let the metric space (J 7 , d) and the 
transformation T : T -> T be defined as in the proof of Theorem 2.2. The functional 
equation Tf = f can be used to evaluate various integrals of /. As an example 
consider the problem of evaluating the integral 

1= / f(x)dx. 

Jxo 

The integral is well defined because f(x) is continuous. We have 
1=1 (Tf)(x)dx = J2 ( T f^ x> > dx 

N px N 

= J2 / (c n x + d n f(x) + f n )d(a n x + e n ) =al + p, 

n = l ^0 

where 

N N i>x N 

a = C^ j a n d n ) and p = J2 a * (c n x + f n )dx. 
Show that, under the standard assumptions, |a| < 1. Show also that 

fX N 

0= / f 0 (x)dx, 

Jxo 
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Figure VI. 168. Illus- 
tration of the geometrical 
viewpoint concerning the 
integration of fractal inter- 
polation functions. 



(total area) =( Det(Aj)+Det (A 2 ))x(total area) + B 




where f 0 (x) is the piecewise linear interpolation function associated with the data. 
Conclude that 

C XN 6 
/ f(x)dx = — ^~ 

Check this result for the case of the parabola, described in Exercise 2.4. In Fig- 
ure VI. 168 we illustrate a geometrical way of thinking about the integration of a 
fractal interpolation function. 

2. 1 7. Let f{x) denote a fractal interpolation function associated with a set of data 
{(*,-, Fi) : i = 0, 1, 2, . . . , AT}, where N > 1. By following similar steps to those in 
exercise 2.16, find a formula for the integral 

h = I xf(x)dx. 

Jx 0 

Check your formula by applying it to the parabola described in exercise 2.4. 

2.18. Figure VI. 169 shows a fractal interpolation function together with a zoom. 
Can you reproduce these images and then make a further zoom? What do you expect 
a very high-magnification zoom to look like? 
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Figure VI. 169. A frac- 
tal interpolation function 
together with a zoom. If 
the fractal dimension is 
equal to 1, what do you 
expect "most" very high 
magnification zooms to 
look like? 



3 The Fractal Dimension of Fractal Interpolation Functions 

The following excellent theorem tells us the fractal dimension of fractal interpolation 
functions. 

Theorem 3.1 Let N be a positive integer greater than 1. Let {(*„, F n ) € D& 2 : 
n = 1, 2, . . . , N} be a set of data. Let {R 2 ; w n , n = 1, 2, . . . , N) be an IFS associ- 
ated with the data, where 

-(;)=(: :)(;HD -»=^ -»- 

The vertical scaling factors d n obey 0 < d n < I; and the constants a n ,c n ,e„, and f n 
are given by equations 1,2,3, and 4, for n = 1,2, . . . , N . Let G denote the attractor 
of the IFS, so that G is the graph of a fractal interpolation function associated with 
the data. If 

N 

X>*i>i (i) 

n=l 

■4 

and the interpolation points do not all lie on a single straight line, then the fractal 
dimension of G is the unique real solution D of 
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Figure VI. 170. The 

graph G of a fractal 
interpolation function is 
superimposed on a grid 
of closed square boxes 
of side length e. is 
used to denote the number 
of boxes that intersect 
G. What is the value of 
JV(0? 



j2\dn\at X = I- 

n=\ 

Otherwise the fractal dimension ofG is 1. 

Proof (Informal Demonstration). The formal proof of this theorem can be 
found in [Barnsley 1988f]. Here we give an informal argument for why it is true. We 
use the notation in the statement of the theorem. 

Let 6 > 0. We consider G to be superimposed on a grid of closed square boxes of 
side length e, as illustrated in Figure VI. 170. Let A/"(e) denote the number of square 
boxes of side length e which intersect G. These boxes are similar to the ones used 
in the Box Counting Theorem, Theorem 1.2 in Chapter V, except that their sizes are 
arbitrary. On the basis of the intuitive idea introduced in Chapter V, section 1, we 
suppose that G has fractal dimension D, where 

N(e) ^ constant • e~ D as e -» 0. 

We want to estimate the value of D on the basis of this assumption. 

Let n e {1,2, . . . , N}. Let M n (e) denote the number of boxes of side length e 
which intersect w n (G) for n = 1, 2, . . . , N. We suppose that 6 is very small com- 
pared to \x N — x 0 \. Then because the IFS is just-touching it is reasonable to make 
the approximation 

N{€) « MO) + + A/" 3 (e) + • • • + Af N (€). (2) 
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Figure VI. 171. The 

boxes that intersect G can 
be thought of as organized 
in columns. The set of 
columns of boxes of side 
length e which intersect G 
is denoted by {c,-(e) : j = 
1,2, ...,/C(e)}. wh ere 
/C(e) denotes the number 
of columns. What is the 
value of /C(e) and how 
many boxes are there in 
c 2 (e), i n this illustration? 



We now look for a relationship between M(e) and A/" n (e). The boxes that intersect G 
can be thought of as being organized into columns, as illustrated in Figure VI. 171. 

Let the set of columns of boxes of side length e which intersect G be denoted 
by {cj(e) : = 1, 2, . . . , /C(e)}, where /C(e) denotes the number of columns. Under 
the conditions in equation 1, in the statement of the theorem, one can prove that the 
minimum number of boxes in a column increases without limit as e approaches zero. 
To simplify the discussion we assume that 

|4| >a n forn = 1,2, N. 

(Notice that 

N N (y — v ^ 

which tells us that this assumption is stronger than the assumption \ d *\ > ^) 
Then consider what happens to a column ofboxes cj(e) of side length e when we 
apply the affine transformation w n to it. It becomes a column of parallelograms. The 
width of the column is a n e and the height of the column is \d n \ times the height of 
the column before transformation. Let J\f(Cj(e)) denote the number of boxes in the 
column cj(e). Then the column w n (Cj(€)) can be thought of as being made up of 
square boxes of side length a n e, each of which intersects w n (G). How many boxes 
of side length a n e are there in this column? Approximately \d n \M(c(€))/a n . Adding 
up the contribution to N n (a n e) from each column we obtain 

KM „ g m^ML = 1*1 X>(c, (e )) = !*W(o. 

a n a " j^i a " 

The situation is illustrated in Figure VI. 172. 
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Figure VI. 172. When 
the shear transforma- 
tion w\ is applied to the 
columns of boxes which 
cover the graph, G, the 
result is a set of thinner 
columns, of width a\€, 
which cover W\(G). The 
new columns are made up 
of small parallelograms, 
but the number of square 
boxes of side length a\€ 
which they contain is read- 
ily estimated. 




From the last equation we deduce that when e is very small compared to [x 0 , x N ], 
K(€) « — M{ — ) for n = 1, 2, . . . , N. (3) 
We now substitute from equation 3 into 2 to obtain the functional equation 

a\ a\ a 2 a 2 a 3 a 3 a N a N 

Into this equation we substitute our assumption M{e) & constant e~ D to obtain the 
equation 

e- D * \d x \ax D -^- D + \d 2 \a 2 D - l e~ D + \d 3 \a 3 D - l € - D + ■■■ + \d N \a N D -'e~ D . 

The main formula in the statement of the theorem follows at once. 

If the interpolation points are collinear, then the attractor of the IFS is the line 
segment that connects the point (x 0 , F 0 ) to the point (x N , F N ), and this has fractal 
dimension 1. If J2n=i \dn\<l, then one can show that Af(e) behaves like a constant 
times e -1 , whence the fractal dimension is 1. This completes our informal demon- 
stration of the theorem. 

Examples & Exercises 

3.1. We consider the fractal dimension of a fractal interpolation function in the 
case where the interpolation points are equally spaced. Let jc ; = xo + jj (x N — x 0 ) 
for i = 0, 1, 2, . . . , N. Then it follows that a n = ^ for n = 1, 2, . . . , N. Hence if 
condition (1) in Theorem 3.1 holds then the fractal dimension D of the interpolation 
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function obeys 




D-l 



(^"-E 14.1 = 1- 

n = l 



It follows that 



D= 1 + 



log 



(e^Uri) 



logW 



This is a delightful formula for reasons of two types, (a) and (b). (a) This formula 
confirms our understanding of the fractal dimension of fractal interpolation func- 
tions. For example, notice that J2n=i M < N - Hence the dimension of a fractal 
interpolation function is less than 2: however, we can make it arbitrarily close to 2. 
Also, under the assumption that Yln=i > the fractal dimension is greater than 
1: however, we can vary it smoothly down to 1. (b) It is remarkable that the fractal 
dimension does not depend on the values {F, ; : / = 0, 1, 2, . . . , N}, aside from the 
constraint that the interpolation points be noncollinear. Hence it is easy to explore 
a collection of fractal interpolation functions, all of which have the same fractal di- 
mension, by imposing the following simple constraint on the vertical scaling factors: 



Figure VI. 173 illustrates some members of the family of fractal interpolation 
functions corresponding to the set of data {(0, 0), (1, 1), (2, 1), (3, 2)}, such that the 
fractal dimension ofeach member of the family is D = 1 .3. 

Figures 173 (a) and (c) illustrate members of a family of fractal interpolation 
functions parameterized by the fractal dimension D. Each function interpolates the 
same set of data. 

3.2. Make an experimental estimate of the fractal dimension of the graphical data 
in Figure VI. 175. Find a fractal interpolation function associated with the data 
{(0, 0), (50, 50), (100, 0)}, which has the same fractal dimension and two equal ver- 
tical scaling factors. Compare the graph of the fractal interpolation function with the 
graphical data. 

3.3. Find a fractal interpolation function that approximates the experimental data 
shown in Figure VI 47. 

3.4. Figure VI. 176 shows the graphs of functions belonging to various one-para- 
meter families of fractal interpolation functions. Each graph is the attractor of an 
IFS consisting of two affine transformations. Find the IFS associated with one of the 
families. 



N 



D-l 
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Figure VI. 173. Mem- 
bers of the family of frac- 
tal interpolation functions 
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(c) 



4 Hidden Variable Fractal Interpolation 

We begin by generalizing the results of section 6.2. Throughout this section, let 
(Y, d Y ) denote a complete metric space. 

Definition 4. 1 Let /cR. Let f : I ->Y be a function. The graph off is the set 

of points 

G = {(*, /(*)) e R x Y:xel}. 

Definition 4.2 A set of generalized data w a set of points of the form {(*,-, F,) e 
R x Y : i = 0, 1, 2, . . . , iV}, where 

X 0 < Xi < X 2 < X3 < • ■ • < X N . 

An interpolation function corresponding to this set of data is a continuous function 
f : [x 0 , x N ] -> Y such that 

f( Xi ) = Fi fori = 1,2, ...,N. 

The points (x { , F,) e R x Y are called the interpolation points. We say that the func- 
tion f interpolates the data and that (the graph of) f passes through the interpola- 
tion points. 

Let X denote the Cartesian product space IxY. Let 9 denote a positive number. 
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Figure VI. 174. Mem- 
bers of a one-parameter 
family of fractal interpo- 
lation functions. They 
correspond to the set 
of data {(0,0), (1, 1), 
(2, 1), (3, 2)} with ver- 
tical scaling factors 
d x = -d 2 = d 3 = 3 d " 2 
for D = 1, 1.1, 1.2, 
1.3, 1.4, 1.5, 1.6, and 1.7. 
D is the fractal dimension 
of the fractal interpolation 
function. 
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Figure VI. 175. Make 

an experimental estimate 
of the fractal dimension of 
the graphical data shown 
here. Find a fractal inter- 
polation function associ- 
ated with the data {(0, 0), 
(50,-50), (100,0)}, 
which has the same fractal 
dimension and two equal 
vertical scaling factors. 
Compare the graph of the 
fractal interpolation func- 
tion with the graphical 
data. 




Figure VI. 176. This figure shows the graphs of various one-parameter families of 
interpolation functions. Each graph is the attractor of an IFS consisting of two afi 
transformations. Can you find the families? 
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Define a metric d on X by 

d(X u X 2 ) = \xi - x 2 \ + Od Y (yi, y 2 ), (1) 

for all points Xi = (x u y\) and X 2 = (x 2 , yi) in X. then (X, d) is a complete metric 
space. 

Let N be an integer greater than 1. Let a set of generalized data {Or,-, F/)eX:i = 

0, 1, 2, . . . , N] be given. Letn e {1, 2, . . . , TV}. Define L n : R -+ R by 

r , . . (x n —X n -i) (x N X n _i — XQX n ) 

L n (x) = a n x + e n where a n = — — and e„ = . (2) 

{x N — Xo) (Xn — Xo) 

so that L n ([xo, x N ]) = [x n -i,x n ]. Let c and s be real numbers, with 0 < s < 1 and 
c > 0. For each n e { 1 , 2, . . . , /V} let M n : X Y be a function that obeys 

d(M n (a, y), M n (b, y)) < c\a - b\ for all a,beR, (3) 

and 

d(M n (x, a), M n (x, b)) < sd Y (a, b) for all a, b e Y. (4) 
Define a transformation w n : X X by 

Wn (x, y) = (L n (x), M n (x, y)) for all (x,y)eX,n = l,2 TV. 

Theorem 4.1 Let the IFS {X; w n , n = 1, 2, . . . , /V} be defined as above with 
N > 1 . In particular, assume that there are real constants c and s such that 0 < s < 

1, 0 < c, and conditions 3 and 4 are obeyed. Let the constant 0 in the definition of 
the metric d in equation 1 be defined by 

9 = — — — where a — max{a ( - : i = 1, 2, . . . , /V}. 
2c 

Then the IFS {X; w„, n = 1, 2, . . . , /V} is hyperbolic with respect to the metric d. 

Proof This follows very similar lines to the proof of Theorem 2.1. We leave it as 
an exercise for enthusiastic readers. The proof can also be found in [Barnsley 1986]. 

We now constrain the hyperbolic IFS {X; w n , n = 1, 2, . . . , /V}, defined above, to 
ensure that its attractor includes the set of generalized data. We assume that 

M n (x 0 , F 0 ) = F n _! and M n (x N , F N ) = F n for n = 1, 2, . . . , N. (5) 

Then it follows that 

w n (x 0 , F 0 ) = (x»-\, F„_i) and w„(x N , F N ) = (x„, F n ) for n = 1, 2, . . . , TV. 

Theorem 4.2 Let N be a positive integer greater than I. Let {X; w n ,n = 
1,2, . . . , N} denote the IFS defined above, associated with the generalized data set 
{(Xi, Fj) e R x Y : i — 1,2, . . . , N}. In particular, assume that there are real con- 
stants c and s such that 0<s<l,0<c, and conditions 3, 4, and 5 are obeyed. Let 
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G e H(X) denote the attractor of the IFS. Then G is the graph of a continuous func- 
tion f : [x 0 , x N ] Y which interpolates the data {(*,, F,) : / = 1, 2, . . . , N}. That 
is, 

G = {(x, f(x)) : x e [x 0 , x N ]}, 

where 

f( Xi ) = Fi for j=0, 1,2,3, ...,N. 

Proof Again we refer to [Barnsley 1988f]. The proof is analgous to the proof of 
Theorem 2.2. 

Definition 4.3 The function whose graph is the attractor of an IFS, as described 
in Theorems 4.1 and 4.2, above, is called a generalized fractal interpolation function, 
corresponding to the generalized data {(*, , F,) : / = 1, 2, . . . , N}. 

We now show how to use the idea of generalized fractal interpolation functions 
to produce interpolation functions that are more flexible than heretofore. The idea is 
to construct a generalized fractal interpolation function, using affine transformations 
acting on R 3 , and to project its graph into R 2 . This can be done in such a way that 
the projection is the graph of a function that interpolates a set of data {(*, , F,) e 
R 2 : i = 1, 2, . . . , N}. The extra degrees of freedom provided by working in R 3 
give us "hidden" variables. These variables can be used to adjust the shape and 
fractal dimension of the interpolation functions. The benefits of working with affine 
transformations are kept. 

Let N be an integer greater than 1. Let a set of data {(*,-, F,) e R 2 : i = 0, 1, 2, 
N} be given. Introduce a set of real parameters {//, : / = 0, 1, 2, . . . , N}. For 
the moment let us suppose that these parameters are fixed. Then we define a gen- 
eralized set of data to be {(*,-, F,-, Hi) e R x R 2 : j = 0, 1, 2, . . . , N}. In the present 
application of Theorem 4.2 we take (Y, d Y ) to be (R 2 , Euclidean). We consider an 
IFS {R 3 ; w n ,n = l,2,..., N}, where for n e {1, 2, . . . , N} the map w n : R 3 R 3 
is an affine transformation of the special structure: 





~ X ~ 




~a n 
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0 " 
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Here a n , c n , d n , e n , /„, g n , h n , K, /„, and m„ are real numbers. We assume that they 
obey the constraints 
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Fn 
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, forn = 1,2, N. 



Then we can write 



Wn (x, y, z) = (L n (x), M n (x, y, z)) for all (x, y, z) e R 3 , n = 1, 2, 



N, 



where L n (x) is defined in Equation 2 and M n : 



is defined by 



where 



A„ = 



= A n 



+ 



fn'~^~ CnX 
8n k n x 



d„ h n 
In m n 



for n = 1,2, . . . , N . 



(6) 



Let us replace F„ in condition 5 by (F n , H n ). Then M n obeys condition 5. Let us 
define 

c = max{max{c,, fc,} : / = 1, 2, . . . , N}. 

Then condition 3 is true. Lastly, assume that the linear transformation A n : R 2 — > R 2 
is contractive with contractivity factor s with 0 < s < 1. Then condition 4 is true. 
We conclude that, under the conditions given in this paragraph, the IFS {R 3 ; w n , n = 
1,2, . . . , N} satisfies the conditions of Theorem 4.2. It follows that the attractor of 
the IFS is the graph of a continuous function / : [xq, x^] — > R 2 such that 



Now write 



Then fx : [x 0 , x N ] 



f(x i ) = {F i ,H i )fovi = \,2,...,N. 

f(x) = (fi(x),f 2 (x)). 
[ is a continuous function such that 
fi(x() = Fi for / = 1,2, ...,N. 



Definition 4.4 The function f\ : [xo, x^] — > R 2 constructed in the previous 
paragraph is called a hidden variable fractal interpolation function, associated with 
the set of data {(xt, F t ) e R 2 : i = 1, 2, . . . , N}. 

The easiest method for computing the graph of a hidden variable fractal interpo- 
lation function is with the aid of the Random Iteration Algorithm. Here we present 
an adaptation of Program 1. It computes points on the graph of a hidden variable 
fractal interpolation function and displays them on a graphics monitor. It is written 
for N = 3 and the data set 



{(0, 0), (30, 50), (60, 40), (100, 10)}. 
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The "hidden" variables, namely the entries of the matrices A n and the number H n 
for n = 1, 2, 3, are input by the user during execution of the code. The program 
calculates' the coefficients in the three-dimensional affine transformations from the 
data and then applies the Random Iteration Algorithm to the resulting IFS. The first 
two coordinates of each successively computed point, which has three coordinates, 
are plotted on the screen of the graphics monitor. The program is written in BASIC. 
It runs without modification on an IBM PC with Enhanced Graphics Adaptor and 
Turbobasic. On any line the words preceded by a ' are comments: they are not part 
of the program. 

Program 2. 

x[0] =0 : x[l] =30 : x[2] =60 : x[3] =100 'Data set 
F[0] =0 : F[l] =50 : F[2] =40 : F[3] =10 

input "enter the hidden variables H[0] , H[l] , H[2] and H[3]", 
H [1] , H [2] , H [3] , H [4] ' Hidden Variables 

for n = 1 to 3 : print "for n = ", n 

input "enter the hidden variables d, h, 1, m" , 

d[n] ,hh[n] ,l[n] ,m[n] 'More Hidden Variables 

next 

for n = 1 to 3 'Calculate the affine transformations from 
the Data and the Hidden Variables 

p = F[n-l]-d[n]*F[0]-hh[n]*H[0] : q = H[n-1] -1 [n] *F[0] -m[n] *H[0] 
r = F[n]-d[n]*F[3]-hh[n]*H[3] : s = H [n] -1 [n] *F [3] -m [n] *H [3] 

b = x[3]-x[0] : c[n] = (r-p)/b : k[n] = (s-q)/b 
a[n] = (x[n]-x[n-l])/b : e[n] = (x [3] *x [n-l] -x [0] *x [n] ) /b 
ff [n] = p-c[n]*x[0] : g[n] = q-k [n] *x [0] 
next 

screen 2 : els ' initalialize graphics 
window(0,0)-(100,100) 'change this to zoom and/or pan 
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x =0: y =0: z =hh[0] 'initial point from which the random 
iteration begins 

for n = 1 to 1000 'Random Iteration Algorithm 

kk = int(3*rnd-0.0001)+l 

newx = a[kk]*x + e[kk] 

newy = c[kk]*x + d[kk]*y + hh[kk]*z + f f [kk] 
newz = k[kk]*x + l[kk]*y + m[kk]*z + g[kk] 
x = newx : y = newy: z = newz 

pset(x,y),z 'plot the most recently computed point, in 
color z, on the screen 

next 
end 

The result of running an adaptation of this program on a Masscomp workstation 
and then printing the contents of the graphics screen is presented in Figure VI. 177. In 
this case H[0] = 0, H[l] = 30, H[2] = 60, H[3] = 100, d(l) = d(2) = d(3) = 0.3, 
h(\) = h(2) = 0.2, h(3) = 0.1, 1(1) = 1(2) = 1(3) = -0.1, m(l) = 0.3, m(2) = 0, 
m (3) _ _o.i. Remember that the linear transformation A n must be contractive, so 
certainly do not enter values of magnitude larger than 1 for any of the numbers 
d(n), h(n ),l(n), and m(n). The program renders each point in a color that depends 
on its z-coordinate. This helps the user to visualize the "hidden" three-dimensional 
character of the curve. 

The important point about hidden variable fractal interpolation is this. Although 
the attractor of the IFS is a union of affine transformations applied to the attractor, 
this is not the case in general when we replace the word "attractor" by the phrase 
"projection of the attractor." The graph of the hidden variable fractal interpolation 
function /i(jc) is not self-similar, or self-affine, or self-any thing! 

The idea of hidden variable fractal interpolation functions can be developed using 
any number of "hidden" dimensions. As the number of dimensions is increased, the 
process of specifying the function becomes more and more onerous, and the function 
itself, seen by us in flatland, becomes more and more random. One would never 
guess, from looking at pictures of them, that they are generated by deterministic 
fractal geometry. 
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Y 



Figure VI. 177. An ex- 
ample of a hidden vari- 
able fractal interpola- 





0.3, h(\) =h(2) = 0.2, 
/z(3) = 0.1,/(l)=/(2) = 
/(3) = -0.1, m(\) = 0.3, 
m(2) = 0,m(3) = -0.1. 



X 



Examples & Exercises 

4. 1 . Generalize the proof of Theorem 2. 1 to obtain a proof of Theorem 4. 1 . 

4.2. Let T denote the set of continuous functions f:[x 0 ,x N ]-+Y such that 

f(x 0 ) = F 0 and f(x N ) = F N . 

Define a metric d on T by 

d(f, g) = max{d Y (f(x), g(x)) : x e [x Q , x N ]}. 

Then d) is a complete metric space; see, for example [Rudin 1966]. Use this fact 
to help you generalize the proof of Theorem 2.2 to provide a proof of Theorem 4.2. 

4.3. Rewrite Program 2 in a form suitable for your own computer environment, then 
run it and obtain hardcopy of the output. 

4.4. Modify your version of Program 2 so that you can adjust one of the "hidden" 
variables while it is running. In this way, make a picture that shows a one-parameter 
family of hidden variable fractal interpolation functions. 

4.5. Modify your version of Program 2 so that you can see the projection of the 
attractor of the IFS into the (y, z) plane. To do this simply plot (y, z) in place of 
(x, y). Make hardcopy of the output. 

4.6. Figure VI. 178 shows three projections of the graph G of a generalized fractal 
interpolation function / : [0, 1] [R 2 . The projections are (i) into the (x, y) plane, 
(ii) into the (x, z) plane, and (iii) into the (y, z) plane. G is the attractor of an IFS of 
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the form {R 3 ; w u w 2 }, where w\ and w 2 are afflne transformations. Find w x and w 2 . 
See also Figure VI. 179. 

4.7. Use a hidden-variable fractal interpolation function to fit the experimental data 
in Figure V.147. Here is one way to proceed, (a) Modify your version of Program 
6.4. 1 so that you can adjust the "hidden" variables from the keyboard, (b) Trace 
the data in the Figure V.147 onto a sheet of flexible transparent material, such as a 
viewgraph. (c) Attach the tracing to the screen of your graphics monitor using clear 
sticky tape, (d) Interactively adjust the "hidden" variables to provide a good visual 
fit to the data. 

4.8. *Show that, with hidden variables, one can use affine transformations to con- 
struct graphs of polynomials of any degree. 



5 Space-Filling Curves 

Here we make a delightful application of Theorem 4.2. Let A denote a nonempty 
pathwise-connected compact subset of R 2 . We show how to to construct a continu- 
ous function / : [0, 1] -> R 2 such that f([0, 1]) = A. 

Let (Y,d Y ) denote the metric space (K 2 , Euclidean). We represent points in Y us- 
ing a Cartesian coordinate system defined by a y-axis and a z-axis. Thus, (y, z) may 
represent a point in Y. To motivate the development we take A = m C Y. Consider 
the just-touching IFS {Y; w u w 2 , w 3 , w 4 }, where the maps are similitudes of scaling 
factor 0.5, corresponding to the collage in Figure VI. 180. 

Let 

(F 0 , H 0 ) = (0,0), (F l ,H l ) = (0,0.5), (F 2 , H 2 ) = (0.5, 0.5), 
(F 3 , H 3 ) = (1,0.5), and 
(F 4 ,// 4 ) = d,0). 
The maps are chosen so that 

w n (F 0 , H 0 ) = (F„_!, H n -i) and w n (F A , H A ) = (F n , H n ) for n = 1, 2, 3, 4. 

The IFS code for this IFS is given in Table VI. 1. 

Let Ao g ^(") denote a simple curve that connects the point (F 0 , H 0 ) to the point 
(F 4 , H 4 ), such that A 0 n3i= {(F 0 , H 0 ), (F 4 , H 4 )}. This last condition says that the 
curve lies in the interior of the unit square box, except for the two endpoints of the 
curve. Consider the sequence of sets {A n = W on (A 0 )}~ 0 where W : n ^> ~+ 
is defined by 

W(B) = U 4 n=l w n (B) for all B e H{m). 

It follows from Theorem 7.1 in Chapter III that the sequence converges to ■ in 
the Hausdorff metric. The reader should verify that, for each n = 1, 2, . . . , A n is a 
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Figure VI. 178. This 
figure shows three pro- 
jections of the graph 
of a generalized frac- 
tal interpolation function 
/:[0, l]-> U 2 . The pro- 
jections are into the (x, y) 
plane, the (x, z) plane, and 
the (y, z) plane. G is the 
attractor of an IFS of the 
form wi, w 2 ], where 
u>i and w 2 are affine trans- 
formations. Can you find 
W\ and u; 2 ? 



Figure VI. 179. Three 
orthogonal projections of 
the graph of a general- 
ized fractal interpolation 
function. The fractal di- 
mension here is higher 
than for Figure VI 178. 





J 
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Figure VI. 180. Col- 
lage of ■ using four simil- 
itudes of scaling factor 
0.5. The map w„ is cho- 
sen so that w n (Fo, H 0 ) = 
(F„- U H n -i) and 
u>„(F 4 , H 4 ) = (F„, H n ) for 
n = 1,2, 3,4. 
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(F 2 ,H 2 ) 
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(F3,H 3 ) 
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(F 4 ,H 4 ) 



Table VI. 1 . IFS code for ■, constrained to yield a space-filling curve. 
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simple curve that connects the points (F 0 , H 0 ) to the point (F 4 , H 4 ). Sequences of 
such curves are illustrated in Figures VI.181-VI.184. 

We use the IFS defined in the previous paragraph to construct a continuous func- 
tion / : [0, 1] ■ such that /([0, 1]) = ■. We achieve this by exploiting a hidden 
variable fractal function constructed in a special way. We use ideas presented in 
Chapter VI, section 4. Consider the IFS {R 2 ; w n , n = 1, 2, . . . , N}, where the map 
w n : R 3 OS. 3 is the affine transformation 
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The constants a n , b n , c n , d n , e n , and /„ are defined in Table VI. 1. This IFS satisfies 
Theorem 4.2, corresponding to the set of data 

{(0, F 0 , H 0 ), (0.25, F u Hi), (0.5, F 2 , H 2 ), (0.75, F 3 , H 3 ), (1, F 4 , H 4 )}. 

It follows that the attractor of the IFS is the graph, G, of a continuous function 
y . [0, l] [R 2 . What is the range of this function? It is 
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Figure VI. 181. A se- 
quence of curves "con- 
verging to" a space-filling 
curve. These are obtained 
by application of the De- 
terministic Algorithm to 
the IFS code in Table VI. 1 , 
starting from a curve A 0 , 
which connects (0, 0) to 
(1,0) and lies in ■. 




Figure VI. 182. A 

higher-resolution view 
of one of the panels in 
Figure VI. 184. How long 
is the shortest path from 
the lower left corner to the 
lower right corner? 
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Figure VI. 183. A se- 
quence of sets "converging 
to" a ■. These are obtained 
by application of the De- 
terministic Algorithm to 
the IFS code in Table VI. 1, 
starting from the set A 0 in 
the lower left panel. How 
fascinating they are! 
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Figure VI. 184, A se- 
quence of curves "con- 
verging to" a space-filling 
curve. These are obtained 
by application of the De- 
terministic Algorithm to 
the IFS code in Table VI. 1, 
starting from a curve A 0 , 
which connects (0, 0) to 
(1,0) and lies in ■. 
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G yz = {(y,z)eR 2 :(x,y,z)eG}, 

namely the projection of G into the (y, z) plane. It is straightforward to prove that 
G yz is the attractor A = m of the IFS defined by the IFS code in Table VI. 1. It follows 
that / ([0, 1]) = ■. So we have our space-filling curve! 

We have something else very exciting as well. The attractor of the three-dimen- 
sional IFS is the graph of a function from [0, 1] to ■. The projections G xy and G xz , in 
the obvious notation, are graphs of hidden-variable fractal functions, while G yz = ■. 
What does G look like from other points of view? Various views of the attractor are 
illustrated in Figures VI. 185 and VI. 186. We conclude that G is a curious, complex, 
three-dimensional object. It would be wonderful to have a three-dimensional model 
of G made out of very thin strong wire. 

The following theorem summarizes what we have just learned. 

Theorem 5. 1 Let A cR 2 be a nonempty pathwise-connected compact set, such 
that the following conditions hold. Let N be an integer greater than 1 . Let there be 
a hyperbolic IFS {R 2 ; M n , n = 1, 2, . . . , TV} such that A is the attractor of the IFS. 
Let there be a set of distinct points {(F, , G,) e A : i — 0, 1,2, ... ,N} such that 

M n (F 0 , H 0 ) = (F„_i, H n -{) and w n (F N , H N ) = (F n , H n ) forn = 1, 2, . . . , N. 

Then there is a continuous function f : [0, 1] -> R 2 such that /([0, 1]) = A. One 
such function is the one whose graph is the attractor of the IFS 

{R 2 ; w n (x, y, z) = (^x + ^-i, M n (y, z)), n = 1, 2, . . . , N}. 

Examples & Exercises 

5.1. Let A denote the Sierpinski triangle with vertices at the points (0, 0), (0, 1), 
and (1, 0). Find an IFS of the form {R 3 ; w\, w 2 , W3}, where the maps are affine, such 
that the attractor of the IFS is the graph of a continous function / : [0, 1] — ► R 2 such 
that / ([0, 1]) = A. Four projections of such an attractor are shown in Figure VI. 187. 

5.2. Find an IFS {R 3 ; w\, w 2 , W3, w 4 }, where the transformations are affine, whose 
attractor is the graph of a continuous function / : [0, 1] — ► R 2 such that /([0, 1]) = 
A, where A is the set represented in Figure VI. 188. 
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Figure VI. 185. Vari- 
ous views of the attractor 
of a certain IFS. From 
some points of view we 
see that it is the graph of a 
function. From one point 
of view it is the graph of a 
space-filling curve! 



Figure VI. 186. Higher- 
resolution view of the 
lower right panel of Fig- 
ure VI. 185. 



5 Space-Filling Curves 245 




Figure VI. 187. Four 
views of the attractor of 
an IFS. This attractor is 
the graph of a continuous 
function / : [0, 1] -> R 2 
such that /([0, 1]) is 
a Sierpinski triangle. 
This function provides 
a "space-filling" curve, 
where space is a fractal! 




Figure VI. 188. Find an 

IFS {[R 3 ; w u u> 2 , u> 3 , w 4 ), 
where the transforma- 
tions are affine, whose 
attractor is the graph of 
a continuous function 
/ : [0, 1] -> U 2 such that 
/([0, 1]) = A, where A is 
the set represented here. 



Chapter VII 

Julia Sets 



1 The Escape Time Algorithm for Computing Pictures of IFS Attractors 
and Julia Sets 

Let us consider the dynamical system {IR 2 ; /}, where / : IR 2 -» IR 2 is defined by 

(2x,2y-l) ify>0.5, 
f(x,y)=- (2x-l,y) if x > 0.5 and y < 0.5, 
(2x,2y) otherwise. 

This dynamical system is related to the IFS {R 2 ; w\, w 2 , w 3 }, where 
Wl (x, y) = (0.5x,0.5y + 0.5), 
w 2 (x,y) = (0.5x + 0.5,0.5y), 
w 3 (x,y) = (0.5x,0.5y)}. 
The attractor of the IFS is a Sierpinski triangle A with vertices at (0,0), (0, 1), 
and (1,0). The relationship between the dynamical system {IR 2 ; /} and the IFS 
{1R 2 ; w\, w 2 , W3} is that {A; /} is a shift dynamical system associated with the IFS. 
(Shift dynamical systems are discussed in Chapter IV, section 4.) One readily verifies 
that / restricted to A satisfies 

.-1, 



f(x,y) = 



w;\x, y) if (x, y) e w^A) \ {(0, 0.5), (0.5, 0.5)}, 



w^(x, y) if (x, y) e w 2 (A) \ {(0.5, 0)}, 
^w^(x,y) if (x, y) € 103(A). 

In particular, / maps A onto itself. The dynamical system {1R 2 ; /} is an exten- 
sion of the shift dynamical system {A; /} to IR 2 . The situation is illustrated in Fig- 
ure VII. 189. 

Let d denote the Euclidean metric on IR 2 . The shift dynamical system {IR 2 ; /} 
is "expanding": for any pair of points x u x 2 lying in any one of the three domains 
associated with /, we have 

d(f(x l ),f(x 2 )) = 2d(x l ,x 2 ), 
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In this region f(x,y)=w/2 (x,y) 



One can prove that the orbit {/ OM (*)}£L 0 diverges toward infinity if x does not 
belong to A. That is 

d(0, f on (x)) oo as n oo for any point xel 2 \A. 

What happens if we compute numerically the orbit of a point x e A? Recall that 
the fractal dimension of A is log(3)/ log(2). This tells us that A is "thin" compared 
to R 2 . Hence, although /(A) = A, errors in a computed orbit are likely to produce 
points that do not lie on A. This means that, in practice, most numerically computed 
orbits will diverge, regardless of whether or not the initial point lies on A. The 
Sierpinski triangle A is an "unstable" invariant set for the transformation / : R 2 -+ 
R 2 . It is a "repulsive" |ixed point for the transformation / : H(R 2 ) H(R 2 ). It is 
an attractive fixed point for the transformation W : H(R 2 ) H(R 2 ), where W = 
wi U w 2 U 103 is defined in the usual manner. 



Figure VII. 189. The 

dynamical system {0& 2 , /} 
is obtained by extending 
the definition of a shift 
dynamical system on a 
Sierpinski triangle to all 
of 
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Figure VII. 190. How 

long do orbits of points in 
W take to arrive in V? We 
expect that the number of 
iterations required should 
tell us something about the 
structure of A. 




Intuitively, we expect that orbits of the dynamical system {R 2 ; /} that start close 
to A should "take longer to diverge" than those that start far from A. How fast do 
different orbits diverge? Here we describe a numerical, computergraphical experi- 
ment to compare the number of iterations required for the orbits of different points 
to escape from a ball of large radius, centered at the origin. Let (a, b) and (c, d), re- 
spectively, denote the coordinates of the lower left corner and the upper right corner 
of a closed, filled rectangle W C R 2 . Let M denote a positive integer, and define an 
array of points in W by 

(c — a) (d — b) 

x p , q = (a + p — — — , b + q — — — ) for p, q = 0, 1, 2, . . . , M. 
M M 

In the experiment these points will be represented by pixels on a computer graphics 
display device. We compare the orbits {f on (x p<q ) :}^ 0 for p, q = 0, 1, 2, . . . , M. 

Let R be a positive number, sufficiently large that the ball with center at the origin 
and radius R contains both A and W. Define 

V = {(x, y) e R : x 2 + y 2 > R). 

A possible choice for the rectangle W and the set V, in relation to A, is illustrated 
in Figure VII. 190. In order that the comparison of orbits provides information about 
A, one should choose W so that W n A / 0. 

Let numits denote a positive integer. The following program computes a finite set 
of points 

{f ol (x P , q ), f o2 (x Ptg ), r\x Ptq ), . . . , r n {x p , q )} 

belonging to the orbit of x p , q e W, for each p, q = 1, 2, . . . , M. The total number 
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of points computed on an orbit is at most numits. If the set of computed points of the 
orbit of x p<q does not include a point in V when n =numits, then the computation 
passes to the next value of (p, q). Otherwise the pixel corresponding to x PA is ren- 
dered in a color indexed by the first integer n such that f on (x Pi(j ) e V, and then the 
computation passes to the next value of (p, q). This provides a computergraphical 
method for comparing how long the orbits of different points in W take to reach V. 

The program is written in BASIC. It runs without modification on an IBM PC 
with Enhanced Graphics Adaptor and Turbobasic. On any line the words preceded 
by a ' are comments: they are not part of the program. 

Program 1 . ((Example of the Escape Time Algorithm)) 
numits=20: a=0 : b=0: c=l : d=l : M=100 'Define viewing 

window, W, and numits. 

R=200 'Define the region V. 
screen 9: els 'Initialize graphics, 
for p=l to M 
for q=l to M 

x = a + (c-a)*p/M : y = b + (d-b)*q/M 'Specify the initial 



for n=l to numits 'Compute at most numits points on the orbit 



point of an orbit, x(p,q). 



of x(p,q) . 



if y > 0.5 then 



Evaluate $f$ applied to the 
previous point on the orbit . 



x = 2*x : y = 2*y - 1 



elseif x > 0.5 then 



x = 2*x - 1 



: y = 2*y 



'THE FORMULA FOR THE 
FUNCTION f(x) 



else 



x = 2*x : y = 2*y 



end if 



150 if x*x + y*y > R then 'If the most recently computed 

point lies in V then. . . 
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160 pset(p,q),n : n = numits '...render the pixel x(p,q) 

in color n, and go to the next 

(p,q) • 
170 end if 

if instat then end 'Stop computing if any key is pressed! 

next n : next q : next p 

end 

Color Plate 6 shows the result of running a version of Program 1 on a Masscomp 
5600 workstation with Aurora graphics. 

In Figure VII. 191 we show the result of running a version of Program 1, but 
this time in black and white. A point is plotted in black if the number of iterations 
required to reach V is an odd integer, or if the orbit of the point does not reach V 
during the first numits iterations. 

In Figure VII. 192 we show the result of running a version of Program 1, with 
(a, b) = (0, 0), (c, d) = (5 x 10" 18 , 5 x 10~ 18 ), and numits = 65. This viewing win- 
dow is minute. See also Color Plate 7. Now you should be convinced that A is not 
simplified by magnification. 

The dynamical system {R 2 ; /} contains deep information about the "repelling" 
set A. Some of this information is revealed by means of the Escape Time Algorithm. 
The orbits of points that lie close to A do indeed appear to take longer to escape from 
R 2 \ V than those of points which lie further away. 

Examples & Exercises 

1.1. Let {R 2 , /} denote the dynamical system defined at the start of this chapter 
and let A denote the associated Sierpinski triangle. Prove that the orbit {f on (x)}™ =0 
diverges, for each jc e R 2 \ A. That is, prove that d(0, f on (x)) oo as n oo for 
each* e R 2 \ A. 

1 .2. Rewrite Program 1 in a form suitable for your own computergraphical envi- 
ronment, then run it and obtain hardcopy of the output. 

1 .3. If the Escape Time Algorithm is applied to the dynamical system 

{R 2 ; f(x,y) = (2x,2y)}, 
what will be the general appearance of resulting colored regions? 

1.4. By changing the window size in Program 1, obtain images of "zooms" 
on the Sierpinski triangle. For example, use the following windows: (0, 0) - 
(0.5,0.5); (0,0) - (0.25,0.25); (0,0) - (0.125, 0.125); .. .. How must the total 
number of iterations, numits, be adjusted as a function of window size in order 
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Figure VII. 191. Out- 
put from a modified ver- 
sion of Program 1. A pixel 
is rendered in black if 
either the number of it- 
erations required to reach 

V is an odd integer, or 
the orbit does not reach 

V during the first numits 
iterations. 



Figure VII. 192. Here 
we show the result 
of running a version 
of Program 1, with 
(a, b) = (0,0), (c,d) = 
(5 x 10" 18 , 5 x lO' 18 ), 
and numits — 65. This 
viewing window is 
minute, yet the com- . 
putation time was not 
significantly increased. If 
we did not know it before, 
we are now convinced 
that A is not simplified by 
magnification. 
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that (approximately) the quality of the images remains uniform? Make a graph of 
the total number of iterations against the window size. Is there a possible relation- 
ship between the behavior of numits as a function of window size, and the fractal 
dimension of the Sierpinski triangle? Make a hypothesis and test it experimentally. 

Here we construct another example of a dynamical system whose orbits "try to 
escape" from the attractor of an IFS. This time we treat an IFS whose attractor has a 
nonempty interior. Consider the hyperbolic IFS {M. 2 ; w\, W2}, where 
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The attractor of this IFS is a closed, filled rectangle, which we denote here by 
■. This attractor is the union of two copies of itself, each scaled by a factor 1 /-s/2, 
rotated about the origin anticlockwise through 90°, and then translated horizontally, 
one copy to the left and one to the right. The inverse transformations are 
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, and 



Define / : R 2 -» [R 2 by 



f(x,y) = 



w x l (x, y) 
Wy\x, y) 



when x > 0 
when x <0. 



Then the dynamical system {[R 2 ; /} is an extension of the shift dynamical system 
{■; /} to R 2 . 

What happens when we apply the Escape Time Algorithm to this dynamical 
system? To see, one can replace the function f(x) in Program 1 by 
if x > 0 then 

newx = s*y : newy = -s*x + s 
else 



newx = s*y : newy = -s*x - s 
end if 



'THE FORMULA FOR THE 
FUNCTION f(x) 



x = newx : y = newy 
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Figure VII. 193. An 

image of an IFS attrac- 
tor computed using the 
Escape Time Algorithm. 
This time the attractor of 
the IFS is a filled rectangle 
and the computed orbits of 
points in ■ seem never to 
escape. 



Results of running Program 1, thus modified, with the window W and the escape 
region V chosen appropriately, are shown in Figure VII. 193. 

It appears that the orbits of points in the interior ■ do not escape. This is not 
suprising. The fractal dimension of the attractor of the IFS is the same as the fractal 
dimension of R 2 , so small computational errors are unlikely to knock the orbit off 
the invariant set. It also appears that the orbits of points that lie in R 2 \ m reach V 
after fewer and fewer iterations, the farther away from ■ they start. 

Again we see that the Escape Time Algorithm provides a means for the compu- 
tation of the attractor of an IFS. Indeed, we have here the bare bones of a new algo- 
rithm for computing images of the attractors of some hyperbolic IFS on R 2 . Here are 
the main steps, (a) Find a dynamical system {R 2 ; /} which is an extension of a shift 
dynamical system associated with the IFS, and which tends to transform points off 
the attractor of the IFS to new points that are farther away from the attractor. (This 
is always possible if the IFS is totally disconnected. The tricky part is to find a for- 
mula for f(x), one which can be input conveniently into a computer. In the case of 
affine transformations in R 2 , one can often define the extensions of the domains of 
the inverse transformations with the aid of straight lines.) (b) Apply the Escape Time 
Algorithm, with V and W chosen appropriately, but plot only those points whose nu- 
merical orbits require sufficiently many iterations before they reach V. For example, 
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Figure VII. 194. Im- 
ages of an IFS attractor 
computed using the Es- 
cape Time Algorithm. 
Only points whose orbits 
have not escaped from 
(R \ V after numits iter- 
ations are plotted. The 
value for numits must be 
chosen not too large, as in 
(a); and not too small, as 
in (b); but just right, as in 
(c). 




in Program 1 as it stands, one can replace the three lines 150, 160, and 170 by the 
two lines 

150 if n = numits then pset(p,q),l 

160 if x*x + y*y > R then n = numits 
and define numits = 10. If the value of numits is too high, then very few points will 
not escape from W and a poor image of A will result. If the value of numits is too 
low, then a coarse image of the A will be produced. An image of an IFS attractor 
computed using the Escape Time Algorithm, modified as described here, is shown 
in Figure VII. 194. 

Color Plates 8-12 show the results of applying the Escape Time Algorithm to the 
dynamical system associated with various hyperbolic IFS in R 2 . In each case the 
maps are affine, and the shift dynamical system associated with the IFS has been 
extended to R 2 . 
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Examples & Exercises 

1.5. Modify your version of Program 1 to compute images of the attractor of the 
IFS {C; wi(z) = re w z - 1, tu 2 (z) = re i0 z + 1}, when r = 1/V2 and 0 = 7t/2. 

1.6. Show that it is possible to define a dynamical system {C; /} which extends to 
C the shift dynamical system associated with the IFS 

{C; wdz) = re ie z - 1, w 2 (z) = re w z + 1}, 
for any 0 G [0, 2n), provided that the positive real number r is chosen sufficiently 
small. Note that this can be done in such a way that / is continuous. 

1 .7. Let {A; /} denote the shift dynamical system associated with a totally discon- 
nected hyperbolic IFS in IR 2 . A denotes the attractor of the IFS. Show that there are 
many ways to define a dynamical system {IR 2 ; g} so that /(*) = g(x) for all x e A. 

The Escape Time Algorithm can be applied, often with interesting results, to any 
dynamical system of the form {IR 2 ; /}, {C; /}, or {C; /}. One needs only to specify 
a viewing window W and a region V, to which orbits of points in W might escape. 
The result will be a "picture" of W wherein the pixel corresponding to the point z is 
colored according to the smallest value of the positive integer n such that f on (z) e V. 
A special color, such as black, may be reserved to represent points whose orbits do 
not reach V before (numits +1) iterations. 

What would happen if the Escape Time Algorithm were applied to the dynamical 
system f : C C defined by f(z) = z 2 ? This transformation can be expressed 
f(x, y) = (jc 2 - y 2 , 2xy). From the discussion of the quadratic transformation in 
Chapter III, section 4, we know that the orbits of points in the complement of the 
unit disk F = {z e C : \z\ < 1} converge to the point at infinity. Orbits of points in 
the interior of F converge to the origin. So if W is a rectangle that contains F 
and if the radius R, which defines V, is sufficiently large, then we expect that the 
Escape Time Algorithm would yield pictures of F surrounded by concentric rings of 
different colors. The reader should verify this! 

F is called the filled Julia set associated with the polynomial transformation 
f( z ) = z 2 . The boundary of F is called the Julia set of /, and we denote it by 
J. It consists of the circle of radius 1 centered at the origin. One can think of J 
on the Riemann Sphere as being represented by the equator on a globe. This Julia 
set separates those points whose orbits converge to the point at Infinity from those 
whose orbits converge to the origin. Orbits of points on J itself cannot escape, 
either to infinity or to the origin. In fact J e H(C) and /(/) = J — f~ l (J)- It is 
an "unstable" fixed point for the transformation / : H{C) 7i(C). 

Definition 1.1 Let f :C C denote a polynomial of degree greater than 1 . 

Let F f denote the set of points in C whose orbits do not converge to the point at 
i 

infinity. That is, 

F f = {zeC: {\f on (z)\}Zo is bounded }. 
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Figure VII. 195. Illus- 
tration showing what is 
going on in the proof 
of Theorem 1.1. This il- 
lustrates the increasing 
sequence of sets {V„} of 
the point at infinity. It also 
shows the decreasing se- 
quence of sets, K n , the 
complements of the lat- 
ter, which converge to the 
filled Julia set F f . In gen- 
eral the origin, O, need 
not belong to Ff. 



This set is called the filled Julia set associated with the polynomial f. The boundary 
of Ff is called the Julia set of the polynomial f, and it is denoted by Jf. 

Theorem 1.1 Let f : C C denote a polynomial of degree greater than 1. 
Let Ff denote the filled Julia set of f and let Jf denote the Julia set of f. Then F f 
and Jf are nonempty compact subsets of C; that is, Ff e H(C) and Jf e H(C). 
Moreover, f(J f ) = Jf = f~ l (Jf) and f(F f ) = F f = f-\F f ). The set V 00 = C\ 
Ff is pathwise-connected. 

Proof We outline the proof for the one-parameter family of transformations f x : 
C -* C defined by 

f x {z) = z 2 - k, where X e C is the parameter. 

The general case is treated in [Blanchard 1984], [Brolin], [Fatou 1919-20], and 
[Julia 1918], for example. This outline proof is constructed to provide information 
about the relationship between the theorem and the Escape Time Algorithm. Some 
of the ideas and notation used here are illustrated in Figure VII. 195. 

Let J x denote the Julia set for fx and let F x denote the filled Julia set for f x . Let d 
denote the Euclidean metric on C and let 

R > 0.5 + 70.25 + \k\. 

Then it is readily verified that 

d(0, f(z)) > d(0, z) for all z such that d(0, z) > R. 

Define 

V = {zeC:|z|>fl}U{oo}. 



Ball around oO 
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Then it follows that 

/(V) C V. 

One can prove that the orbit {f on (z)} converges to oo for all z e V. No bounded orbit 
intersects V. It follows that 

F x = {z € C : f on (z) & V for each finite positive integer n). 

That is, Fx is the same as the set of points whose orbits do not intersect V. 
Now consider the sequence of sets 

V n = f°- n (V) forn=0, 1,2,.... 

For each nonnegative integer n, V n is an open connected subset of (C, Spherical). 
V„ is open because V is open and / is continuous. V n is connected because of the 
geometry of the quadratic transformation, described in Chapter III, section 4: The 
inverse image of a path that joins the point at infinity to any other point on the sphere 
is a path that contains the point at infinity. 
Since /(V) C V it follows that V C / _1 (V). This implies that 

V = V 0 C Vi C V 2 C V 3 C • • • C V„ C • • • . (1) 

For each nonnegative integer n, 

V n = [zeC: {z, f ol (z), f°\z), f°\z), ■ ■ ■ , f on (z)} n V # 0} . 

That is, V n is the set of points whose orbits require at most n iterations to reach V. 
Let 

K n = C\V n forn=0, 1,2, 3, .... 

Then K n is the set of points whose orbits do not intersect V during the first n 
iterations. That is, 

K n = [z e C : {z, f ol (z), f°\z), f°\z), f on (z)} nV = &]. 

For each nonnegative integer n, K n is a nonempty compact subset of the metric space 
(C, Spherical). How do we know that K„ is nonempty? Because we can calculate that 
/ possesses a fixed point z/ e C, by solving the equation 

f(zf) = z 2 f -X. = Zf. 

The orbit of z/ converges to z/. Hence it cannot belong to V n for any nonnegative 
integer n. Hence Zf e K n for each nonnegative integer n. 
Equation 1 implies that 

K 0 1D KiD K 2 D K 3 D • • O K n D • • • . 
It follows that {K„} is a Cauchy sequence in H(C). It follows that {K n } converges to 
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a point in H(C). The limit is the set of points whose orbits do not intersect V. Hence 

F f = lim K n = n™ =0 K n , 

and we deduce that F f belongs to H(C). 
The equation 

K n+l = f°-\K n ) for* =0,1,2,... 
now implies, as in the proof of Theorem 4.1, that 

F x = f 0 ~ l (F x ). 
Applying / to both sides of this equation, we obtain 

f(Fx) = Fx- 

Let us now consider the boundary of F x , namely the Julia set J x for the dynamical 
system {C; /*}. Let z e interior(F x ). Then the continuity of / implies f~ l (z) C 
interior(Fx). Hence F x D f~ l (dF k ) D ^F x . Now suppose that z e f~ l (dF k ). Let O 
be any open ball that contains z. Since / is analytic, f(0) is an open ball, and it 
contains f(z)edF x . Hence f(0) contains a point whose orbit converges to the 
point at infinity. It follows that O contains a point whose orbit converges to the 
point at infinity. Thus f~ l (dF x ) C 3F X . We conclude that r\dF x ) = dF x and in 
particular that f(dF x ) = dF x . This completes the proof of the theorem. 

We summarize some of what we discovered in the course of this proof. The 
filled Julia set F x is the limit of a decreasing sequence of compact sets. Its comple- 
ment, which we denote by Voo, is the limit of an increasing sequence {V„} of open 
pathwise-connected sets in (C, spherical). That is, 

Voo= lim V B = U~ 0 V B . 

The latter is called the basin of attraction of the point at infinity under the polyno- 
mial tranformation /*. It is connected because each of the sets V„ is connected. We 
have 

C = F x U Voo. 

Voo is open, connected, and nonempty. F x is compact and nonempty. 

The Escape Time Algorithm provides us with a means for "seeing" the filled Julia 
sets F x , as well as the sequences of sets {V„} and {K n } referred to in the theorem. 
Let us look at what happens in the case X = 1.1. Define V by choosing R = 4, and 
let W = {(x, y) : -2 < jc < 2, -2 < y < 2}. The function / x =i.i : C C is given 
by the formula 

A=i.i(x,y) = (x 2 -y 2 -l.l,2xy) for all (x, y) e C. 

An example of the result of running the Escape Time Algorithm, with V, W and 
f:C-*C thus defined, is shown in Figure VII. 196. The black object represents 
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the filled Julia set Fx=\.\. The contours separate the regions V n +\ \ V„, for some 
sucessive values of n. These contours also represent the boundaries of the regions 
K n referred to in the proof of the theorem. We refer to them as escape time contours. 
Points in V n +i \V n have orbits that reach V in exactly (n + 1) iterations. In Color 
Plate 13 we show another example of running the Escape Time Algorithm to produce 
an image of the same set. The regions V n+ \ \ V n are represented by different colors. 

Figure VII. 197 shows a zoom on an interesting piece of Fx=\.\, including parts of 
some escape time contours. This image was computed by choosing W to be a small 
rectangular subset of the window used in Figure VII. 196. 

Figures VII.198(a)-(e) shows pictures of the filled Julia sets Fx for a set of real 
values of k. These pictures also include a number of the escape time contours, to 
help indicate the location of Fx. Fq is a filled disk. As k increases, the set becomes 
more and more pinched together until, when k = 2, it is the closed interval [—2, 2]. 
For some values of k e [0, 2], it appears that Fx has no interior, and is "tree-like"; for 
other values it seems to possess a roomy interior. It also appears that Fx is connected 
for all k e [0, 2], and totally disconnected when k > 2. In the latter case Fx may be 
described as a "Cantor-like" set, or as a "dust." The transition between the totally 
disconnected set and the connected, bubbly set as the parameter k is varied reminds 
us of the transition between the the Cantor set and the Sierpinski triangle, discussed 
in connection with Figure IV. 1 18. 

Examples & Exercises 

1.8. Modify your version of the Escape Time Algorithm to allow you to com- 
pute pictures of filled Julia sets for the family of quadratic polynomials z 2 — k for 

Figure VII. 196. The 

Escape Time Algorithm 
provides us with a means 
for "seeing" the filled 
Julia sets F x , as well 
as the sequences of sets 
{V„} and {K n } referred 
to in Theorem 1. In this 
illustration, k = 1.1. The 
black object represents 
the filled Julia set F k=Li . 
The contours separate the 
regions V„_i \ V„, for 
some successive values 
of n. These contours also 
represent the boundaries 
of the regions K n referred 
to in the proof of the 
theorem. 
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Figure VII.197. Zoom 
in on an interesting piece 
of Figure VII. 196. 
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Figure VII. 198. (a)-(e) 
A sequence of Julia set im- 
ages, as in Figure VII. 196, 
for an increasing sequence 
of values k in the range 
0 to 3. In (d) and (e) the 
filled Julia set is the same 
as the Julia set: the filled 
Julia set has no interior, so 
it equals its boundary. In 
(d) the Julia set is "tree- 
like." In (e) the Julia set is 
totally disconnected. 



Figure VII. 198. (b) 
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windows W, which intersect the apparent boundary of the filled Julia set, and to 
seek the window size at which the quality of computed images seems to deteriorate. 
(You will need to increase the maximum number of iterations, M, as you zoom.) 
Can you give evidence to show that the apparently deteriorated images are not, in 
fact, correct? 

1.13. Figure VII. 1 99 was computed by applying the Escape Time Algorithm to the 
dynamical system {C; f(z) = z 4 - z - 0.78}. The viewing window is W = {(*, y) : 
-l<jc<l,-l<y<l}. Determine the escape region V. Also, you might like to 
try magnifying one of the little faces in this image. 

1.14. The images in Figure VII.200 (a),(b),(c), and (d) represent the nontrivially 
distinct attractors of all IFS of the form 

{■; W], w 2 , w 3 }, 

where the maps are similitudes of scaling factor one-half, and rotation angles in the 
set {0°, 90°, 180°, 270°}. The "three translations (0, 0), (1, 0), and (0, 1) are used. 
These IFS are all just-touching. For / / j the set Wi(A) n Wj(A) is contained in one 
of the two straight lines x = 1 or y = 1 . Show that, as a result, it is easy to compute 
these images using the Escape Time Algorithm. 

Here are some observations about this "group" of images. Many of them contain 



Figure VII. 199. This 
image was computed 
by applying the Es- 
cape Time Algorithm 
to the dynamical sys- 
tem {C; f(z) = z 4 - z - 
0.78}. The viewing win- 
dow is W = {(x, y) : 
-1 < x < 1, -1 < y < 1] 
Can you determine the 
escape region V? 
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Figure VII.200. (a)- 

(d) The images in (a), 
(b), (c), and (d) represent 
the nontrivially distinct 
attractors of all IFS of 
the form {■; W\, w 2 , m), 
where the maps are simil- 
itudes of scaling factor 
one-half, and the rota- 
tion angles are in the 
set {0°,90°, 180°, 270°}. 
The three translations 
(0,0), (1,0), and (0, 1) 
are used. These IFS are all 
just-touching. For i ^ j 
the set Wi(A) n Wj(A) is 
contained in one of the 
two straight lines x — 1 or 
y = 1. Hence it is easy to 
compute images of these 
attractors using the Escape 
Time Algorithm. 



Figure VII.200. (b) 
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straight lines. They all have the same fractal dimension. They all use approximately 
the same amount of ink. Many of them are connected. Make some more observa- 
tions. Can you formalize and prove some of these observations? 

1.15. Verify computationally that a "snowflake" curve is a basin boundary for the 
dynamical system {IR 2 ; /}, where for all (x, y) e U 2 , 
f(x,y) = (0,-l)ify<0; 
f(x, y) = (3x, 3y) if y > 0 and x < -y/V3 + 1; 
f( x , y) = ((9 - 3x - 3V3y)/2, (3^3 - 3^3* + 3)/2) 

if y > 0 and - y/V3 + 1 < x < 3/2; 
f(x, y) = ((3x - 3V3y)/2, (3^3x + 3y- 6>/3)/2)/2), 

ify>0 and 3/2 < x < y/ V3 + 2; 
f{x, y) = {9-3x, 3y), if y > 0, and x > y/V3 + 2. 



2 Iterated Function Systems Whose Attractors Are Julia Sets 

In section 1 we learned how to define some IFS attractors and filled Julia sets with 
the aid of the Escape Time Algorithm applied to certain dynamical systems. In this 
section we explain how the Julia set of a quadratic transformation can be viewed as 
the attractor of a suitably defined IFS. 

The Escape Time Algorithm compares how fast different points in W escape to 
V, under the action of a dynamical system. Which set repels the orbits? From where 
do the escaping orbits originate? In the case of the dynamical systems considered at 
the start of section 1, orbits were "escaping from" the attractor of the IFS. 

Let X e C be fixed. Which set repels the orbits, in the case of the dynamical sys- 
tem {£; fx(z) = z 2 — M? To find out let us consider the inverse of fx(z). This is 
provided by a pair of functions, f~ l (z) = {+VF-FX, —y/z + X}, where, for exam- 
ple, the positive square root of a complex number is that complex root that lies on 
the nonnegative real axis or in the upper half plane. Explicitly, = y/x\ + ix 2 = 
(a(xi,x 2 ), b(x\,x 2 )) with 



a(x\, x 2 ) = 



a(x u x 2 ) = - 




2 + Xj + X\ 



when x 2 > 0, 




2 + X$ + X\ 



when x 2 < 0, 
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Figure VII.201. (a) 

and (b) The attractor 

o f the I FS {C; wi(z) = 

Vz + 1, Wliz) = -y/z + 1} 

is the Julia set for the 
transformation f(z) = 
z 2 — 1. (a) illustrates 
points whose orbits "es- 
cape" when the Escape 
Time Algorithm is ap- 
plied, (b) shows the results 
of applying the Random 
Iteration Algorithm to the 
IFS, superimposed on (a). 



To find the "repelling" set, we must try to run the dynamical system backwards. This 
leads us to study the IFS 

{C; wi(z) = VFTI, w 2 (z) = — V z + A.}. 

The natural idea is that this IFS has an attractor. This attractor is the set from which 
points try to flee, under the action of the dynamical system {C; z 2 - A}. 

A few computergraphical experiments quickly suggest a wonderful idea: they 
suggest that the the IFS indeed possesses an attractor, namely the Julia set J k = 
8F k for f x (z). Consider, for example, the case A = 1. Figure VII.201 (a) illustrates 
points in the window W = {z = (x, y) e C : -2 < x < 2, -2 < y < 2} whose orbits 
diverge. It was computed using the Escape Time Algorithm. Figure VII.201 (b) 
shows the results of applying the Random Iteration Algorithm to the above IFS, with 
A = 1 and the same screen coordinates, superimposed on (a). The boundary of the 
region Fx=i is outlined by points on the attractor of the IFS. 

Figures VII.202(a)-(d) show the results of applying the Random Iteration Algo- 
rithm to the IFS {C; wi(z) = Vz + A, w 2 (z) = — V z + A} for various A e [0, 3]. In 
all cases it appears that the IFS possesses an attractor, and this attractor is the Julia 
set Jx. 

Perhaps 

{C; wi(z) = Vz + A, w 2 (z) = -Vz + A} 

is a hyperbolic IFS with J k as its attractor? No, it is not, because C = u>i(Q U 
w 2 (t). The IFS is not 'associated with a unique fixed point in the space H(Q. In 
order to make the IFS have a unique attractor, we need to remove some pieces from 
C, to produce a smaller space on which the IFS acts. 
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Figure VII.201. (b) 



■ ■ ■ ■'V—'' \. 



Theorem 2. 1 Let X e C Suppose that the dynamical system {<C; f(z) — z 2 — A} 
possesses an attractive cycle {z\, zi, Z3, ■■ ■ , z p ] C C. Let e be a very small positive 
number. Let X denote the Riemann Sphere C with (p + 1) e>/?erc of radius e 
removed. (The radius is measured using the spherical metric.) One ball is centered at 
each point of the cycle, and one ball is centered at the point at infinity, as illustrated 
in Figure VII.203. Define an IFS by 



{X; w\(z) = Vz + A, w 2 (z) 
Then the transformation W on7i(X), defined by 



W(B) = w x (B) U w 2 {B) for all B e H(X), 

maps H(X) into itself, continuously with respect to the Hausdorff metric on H(X). 
Moreover W : H(X) — ► H(X) possesses a unique fixed point, J x , the Julia set for 



A. Also 



lim W on (B) = J x for all B e H(X). 



These conclusions also hold if the orbit of the origin, {f on (0)}, converges to the 
point at infinity, and X = C \ B(oo, e). 

Sketch Of proof The fact that W takes H(X) continuously into itself follows 
from Theorem 4.1. To apply Theorem 4.1, three conditions must be met. These 
conditions are (i), (ii) and (iii), stated next. / is analytic on <C so (i) it is continuous, 
and (ii) it maps open sets to open sets. The way in which X is constructed ensures 
that, for small enough e, (iii) /(X) D X. (The latter implies W(X) = f~ l (X) c X.) 

To prove that W possesses a unique fixed point we again make use of Theo- 
rem 4.1. Consider the limit A e H(X) of the decreasing sequence of sets, { W on (X)}, 
namely, 

A = n% l f o( - n) (X)= lim W on (X). 
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Figure VII.202. (a)- 

(d) The results of apply- 
ing the Random Itera- 
tion Algorithm to the IFS 

{£■ VFTX, -VFTI} 

for various values of 
X e [0, 3]. Compare these 
images with those in Fig- 
ure VII. 198. The results 
are pictures of the Julia 
set for f k (z) = z 2 -X.The 
connection between these 
Julia sets and IFS theory is 
revealed! 



Figure VII.202. (b) 




Figure VII.202. (c) 
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Figure VII. 202. (d) 




Figure VII.203. The 

Riemann Sphere C with a 
number of very small open 
balls of radius e removed. 
One ball is centered at 
each of the points {z p e Q 
belonging to an attractive 
cycle of the transformation 
f k ( z ) = z 2 -k. One ball 
is centered at the point at 
infinity. 



Infinity 




This obeys W(A) = A. It follows from [Brolin 65], Lemma 6.3, that A = J k , the 
Julia set. This completes the sketch of the proof. 

Theorem 2.1 can be generalized to apply to polynomial tranformations / : 
C — ► C of degree N greater than 1. Here is a rough description: let f~ l (z) = 
{w\(z), W2(z), . . . , %(z)} denote a definition of branches of the inverse of /. Then 
consider the IFS {€; w\(z), W2(z), ■ . . , w N (z)}. This IFS is not hyperbolic: the "typ- 
ical" situation is that the associated operator W : H(C) — > H(C) possesses a finite 
number of fixed points, all except one of which are "unstable." The one "stable" 
fixed point is 7/, and W on (A) -> 7/ for "most" A e H(C). In principle, 7/ can be 
computed using the Random Iteration Algorithm. 
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Results like Theorem 2.1 are concerned with what are known as hyperbolic Julia 
sets. The Julia set of a rational transformation / : £ -+ £ is hyperbolic if, whenever 
c e £ is a critical point of /, the orbit of c converges to an attractive cycle of /. 
The Julia set for z 2 - 0.75 is an example of a nonhyperbolic Julia set. We refer to 
[Peitgen 1986] as a good source of further information about Julia sets from the 
dynamical systems' point of view. 

Explicit formulas for the inverse maps, {w n (z) : n = 1, 2, . . . , N], for a polyno- 
mial of degree N, are not generally available. So the Random Iteration Algorithm 
cannot usually be applied. Pictures of Julia sets and filled Julia sets are often com- 
puted with the aid of the Escape Time Algorithm. The case of quadratic transfor- 
mations is somewhat special, because both algorithms can be used. The Random 
Iteration Algorithm can also be applied to compute Julia sets of cubic and quar- 
tic polynomials, and of special polynomials of higher degree such as z n + X where 
n = 5, 6, 7, . . ., and X e C 

Examples & Exercises 

2.1. Consider the dynamical system {£; f( z )=z 2 }. The origin, O, is an at- 
tractive cycle of period 1: indeed f(O) = O and \f'(0)\ = 0 < 1. Notice that 
linwoo f on {z) = O for all z e B(0, 0.99999999). Let B (z, r) denote the open ball 
on £, with center at z and radius r. Theorem 2.1 tells us that the IFS 

{X = £ \ {B (O, 0.0000001)U B (oo, 0.0000001)}; w x {z) = ^z~,w 2 (z) = -Vz} 

possesses a unique attractor. The attractor is actually the circle of radius 1 centered at 
the origin. It can be computed by means of the Random Iteration Algorithm. Notice 
that if we extend the space X to include O, then O e H(X) and O = W(O) = 
w x {0) U w 2 (0). If we extend X to include B (O, 0.0000001), then the filled Julia 
set F 0 belongs to H(X) and obeys F 0 = W(F 0 ). If we take X to be all of £ then 
£=W(£). In other words, if the space on which the IFS acts is too large then 
uniqueness of the "attractor" of the IFS is lost. 

Can you find two more nonempty compact subsets of C that are fixed points of 

W, in the case X = C? 

Establish that, for all X e (-0.25, 0.75), the point z 0 = 0.5 - V0.25 + X is an 
attractive cycle of period 1 for {£; z 2 - X}. Deduce that the corresponding IFS, 
acting on a suitably chosen space X, possesses a unique attractor. 

2.2. Let X e (0.75, 1.25). Consider the dynamical system {<C; f(z) = z 2 - X}. Let 
zu Z2 e R denote the two solutions of the equation z 2 + z + (1 - X) = 0. Show 
f(zi) = z 2 , f(z 2 ) = zu\(f o2 Y(zi)\ = l(/ o2 )'(z 2 )l < 1 and hence that {zx,z 2 } is an 
attractive cycle of period 2. Deduce that the IFS 
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{(C \ {B (zi, €)U B (Z2, €)U B (oo, €)}; Wz+A, -Vz + A} 
possesses a unique attractor when e is sufficiently small. 

2.3. The Julia set J\ for the polynomial z 2 — A is a union of two "copies" of 
itself. Identify these two copies for various values of A.. Explain how, when A. = 1, 
the two inverse maps tuj^Cz) and w^iz) rip the Julia set apart, and the set map 
W = w\ U W2 puts it back together again. Where is the rip? Describe the geometry 
of what is going on here. 

2.4. Consider the one-parameter family of polynomials f(z) = z 3 — A, where A e 
C is the parameter. Give explicit formulas for the real and imaginary parts of three 
inverse functions u>i(z), w 2 (z), and wi(z) such that f~ l (z) = {w\(z), u>2(z), w 3 (z)} 
for all A e C Compute images of the filled Julia set for f(z) for A =0.01 and 
A = 1 . Compare these images with those obtained by applying the Random Iteration 
Algorithm to the IFS {C; w x (z), w 2 (z), w 3 (z)}. 

2.5. Consider the dynamical system {C; f(z) — z 2 — A} for A > 2. Show that 
{f on (0)} converges to the point at infinity. Deduce that the IFS 

{X = C \ B(oo, €); +Vz + A, -Vz + A} 

possesses a unique attractor A (A). A (A) is a generalized Cantor set. Compute some 
pictures of A (3). Use the Collage Theorem to help find a pair of affine transforma- 
tions Wi r : R — ► R, i — 1, 2, such that the attractor of the IFS {R; w\, W2} is an ap- 
proximation to A(3). Define / : R — >■ R by f{x) = iff when x < 0 and /CO = 
u>2"' (x) when x > 0. Compare the graphs of the functions f(x) = x 2 — 3 and /CO 
for, say, x e [—4, 4]. Compare one-dimensional "images" obtained by applying the 
Escape Time Algorithm in a similar manner to both {R; f) and {R; f}. 

One can sometimes obtain a hyperbolic IFS associated with a Julia set, if the do- 
mains and ranges of the inverse transformations are defined carefully. The following 
theorem provides such an example. 

Theorem 2.2 Let A e [—0.249, 0.749], and let € be a very small positive num- 
ber. Let a = 0.5 — V0.25 + A, an attractive fixed point of the dynamical system 
{t\ f{z) = z 2 - A}. Let X = C \ {B {a, e)U B (00, e) U (0, 00)}. That is, X consists 
of the Riemann Sphere with a small open ball centered at a, a small open ball cen- 
tered at 00, and the open interval (0, 00), removed. (This space is not compact be- 
cause the edges of the lips of the cut, from O to 00, are missing.) To each lip let there 
be attached copies of the pieces of the real interval (0, 00) which were removed, to 
provide a compact space X, as illustrated in Figure VII. 204. The distance d{z\, zi) 
between a pair of points z\ and zi € X is the length (measured using the spherical 
metric) of the shortest path that lies in X and connects z\ to zi- (Paths in X cannot 
cross the cut, they have to go around it.) (X, d) is a compact metric space. 

Define w\ : X — ► X by w\(z) = Vz + A, the root that lies in the "upper half 
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THE RIEMANN SPHERE 
AFTER SURGERY 



"UPPER HALF PLANE" 




Both edges of the 
cut are copies of [0,oO] 



//////////// 

Cut along the 
positive real axis 
with two lips. 




-Infinity 



Figure VII.204. Con- 
struction of a compact 
metric space X for IFS 
{X; VzTX, -Vz+J} 
with A € (-0.25, 0.75), 
used in Theorem 2.2. 
Both sides of the "slit" 
from —A to infinity belong 
to the space. The distance 
between a pair of points 
on X is the length of the 
shortest path that connects 
the points without cross- 
ing the slit. The distance 
between points may be 
much greater than it looks. 



"LOWER HALF PLANE" 



plane!' For z on the upper edge of the cut, w\(z) is also on the upper edge. For z 
on the lower edge of the cut, w\(z) lies on the negative real axis. Define w 2 : X — ► X 
by w 2 (z) = — y/z + A, the root that lies in the "lower half plane!' For z on the the 
upper edge of the cut, W2(z) lies on the negative real axis; and, for z located on the 
lower edge of the cut, W2(z) lies on the lower edge of the cut. 

Then there is a metric on X, equivalent to the metric d, such that the IFS 
{X; w\, w 2 } is hyperbolic. The attractor is the Julia set J x for z 2 — A, where the 
real point 0.5 + V0.25 + A is repeated on both the upper and the lower edges of the 
cut. 

Sketch Of proof Let e = (e u e 2 , . ■ ■ , e n , . . .) e S, the code space on the two 
symbols {1,2}. Define a sequence of nonempty compact subsets of X by 

X n (e) = w ei o w ei o w e} o w eii o w e5 o w e6 o • • • o w en (X) forn = 1 , 2, 3, 

It follows, using [Brolin], Theorem 6.2, and Lemma 6.3, that the sequence {X n e 
H(X)} converges to a singleton, say {(f>(e)}, where (f>(e) e Jx and 

U eei: 4>(e) = J x . 

A beautiful theorem of Elton [Elton 1988] applies under just these conditions and 
provides the conclusion ©f the theorem. This completes the outline of the proof. 

In those situations where the IFS {X; +Vz + A, - + M is hyperbolic one can 
use the associated code space to discuss both the Julia set and the associated shift 
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dynamical system {A; f(z) = z 2 - X}. Here we give some of the flavor of such a 
discussion. More details can be found in [Barnsley 1984]. 

For the remainder of this sect ion le t X e (-0.25, 0.75) and consider the IFS 
{X; wdz) = VzTX, w 2 (z) = -Vz + M, as defined in Theorem 2.2. Let £ de- 
note the code space on the two symbols {1,2}, and let 0 : £ -* A denote the 
associated code space map, introduced in Theorem 2.1 in Chapter IV. If e = 
(ei,e 2 ,...,e n ,...) € £,then 

ch(e) = lim w ei ° w e2 ° m>* 3 o w e4 o w e5 o w e6 o ■ ■ ■ o w 6n (z). 

Replace the symbol "1" by the symbol "+" and replace the symbol "2" by the 
symbol Then the point 0(e) on the Julia set A can be represented by the 
formula 



4>{e) = e\ 



N 



yXe 6 yl XejyJ 



Xe% . . . e n y/X. . ., 



■\| M 'N 

where c f e {+, -} for each positive integer /. The set J k itself can be represented by 



the collection of formulas 



X± 



x± 



>| X± N 



±}jx±ylx±y/, 



x±...±Vx 



(1) 



where all possible sequences of plus and minus signs are permitted. A particular 
sequence of signs, corresponding to a point in A, is an address of the point. 

In Figure VII.205 we show the Julia set for z 2 - 0.7, with the addresses of various 
points marked on it. Some points on J 0 .i have multiple addresses while others have 
single addresses. It appears that the the IFS is just-touching. 

The shift dynamical system associated with the IFS is {A; /(z) = z 2 - X}. Notice 
how the set of points represented by the formulas in equation 1 is mapped into itself 
by the function that "squares" a formula and subtracts X from the result. A point on 
a cycle of period 2 is represented by 



+ 



x + 



\ N 



x + 



y[x-yjx + j) 



+ *fx~ 



The other point on this cycle is obtained by squaring the formula and subtracting, k. 

In Theorem 2.4 in Chapter IV, we learned that the set of periodic points of the 
shift dynamical system associated with a hyperbolic IFS is dense in the attractor of 
the IFS. Here this tells us that the set of periodic points of the dynamical system 
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Figure VII.205. The Julia set for z 2 - 0.7, labelled with various addresses. Chaotic 
dynamics takes place on the Julia set and orderly dynamics takes place off it. Boundaries 
of a fractal character often separate regions where the dynamical system behaves differently. 
The behavior of the dynamical system on such a boundary may then be indecisive, and in 
some way chaotic. 



{J k ; f(z) = z 2 - k} is dense in J k . In fact, a related idea was the starting point of Ju- 
lia's original investigations. He considered dynamical systems of the form {€; f(z)}, 
where f(z) is analytic. He defined the (Julia) set to be the closure of the set of repul- 
sive cycles of /. 

Following Theorem 8.1 in Chapter IV we explained the sense in which the shift 
dynamical system associated with a hyperbolic IFS is chaotic. In the present context 
we learn that the dynamical system {J k ; z 2 — is chaotic. 

One can think of the dynamical system {€; z 2 - A.} as being the union of two 
dynamical systems, a chaotic one {J k ; z 2 - A} and an orderly one {€ \ J k \ z 2 - 
X}. The orbit of any point in the latter system converges to a fixed point of the 
transformation. The orbits of "most" points in the former system are wild. In practice 
they are usually so wild they cannot be constrained to remain on the repelling set J x . 
They escape and thereafter behave in a rather predictable manner. 

An example of chaotic dynamics on a Julia set is provided by the dynamical 
system {[0, 1]; /(*)■ = 4x(l - x)}. The interval [0, 1] is exactly the Julia set for the 
transformation. This system is close to the "chaotic" one illustrated in Figure IV. 108. 
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Examples & Exercises 

2.6. The Julia set for z 2 - 2 is the interval [-2, 2]. Show that the shift dynamical 
system associated with the IFS {[-2, 2]; +Vz + 2, - Vz +2} is precisely the dy- 
namical system {[—2, 2]; z 2 — 2}. Use a chain of square roots to locate a cycle of 
minimal period 3. 

2.7. Verify numerically that for various choices of ± on each square root, and 
for various complex numbers X such that is very small, the expression below 
evaluates approximately to a complex number that lies on the unit circle centered 
at the origin, if enough square roots are taken. (+ y/z means the solution, w, of the 
equation w 2 = z, which lies either on the nonnegative real axis or in the upper half 
plane.) Make a hand-waving explanation of why this is, in terms of Julia set theory. 



± 



^ \l N 



2.8. Design an IFS with condensation such that its attractor looks like an infinite 
nested chain of square root signs. 




N 

2.9. Figure VTI.206 represents a sequence of sets {A n } which converges to the 
Julia set of f(z) = z 2 — 1. A 0 denotes the union of the two largest faces and A n = 
f oi ~ a) (Ao). Identify the set A 2 . 



3 The Application of Julia Set Theory to Newton's Method 

We are familiar, since our first course in calculus, with Newton's method for com- 
puting solutions of the equation F(x) = 0. Or are we? 

Consider the polynomial F(z) = z A - 1 for z e C. There are four distinct com- 
plex numbers, a, ■ (i = 1, 2, 3, 4) such that F(a, ) = 0. These are called the roots, or 
the zeros, of the polynomial F(z). Newton's method provides a means to compute 
them. Pretend that we do not know that a\ — 1, a 2 — — 1, «3 = *\ and a 4 = Then 
Newton tells us to consider the dynamical system 

{C;/(Z) = Z "^(z) } - 
We call f(z) the Newton transformation associated with the function F(z). The 
general expectation is that a typical orbit {/°"(zo)h which starts from an initial 
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<o> 




Figure VII.206. Let 

/ : € -+ € denote the 
polynomial z 2 — 1. Let 
Aq be the union of the 
two largest smiley faces. 
Define a transforma- 
tion W : H(Q -+ H(Q 
by W(B) = f- l (B) for 
all B € H(C) and let 
A n = W on (A 0 ) for n = 
0, 1, 2, Then the se- 
quence of sets {A n } con- 
verges to the Julia set 
z 2 — 1. Can you identify 

At? 



"guess" zo e C, will converge to one of the roots of F(z). In the present example, 
the Newton transformation is given by 

tl a 324 + 1 

^> = -4i5- 

We expect the orbit of zo to converge to one of the numbers a\, a 2 , a^, or a 4 . If we 
choose zo close enough to a,-, then it is readily proved that 



lim / O "(zo)=««, 



for; = 1,2,3,4. 



If, on the other hand, zo is far away from all of the a, 's, then what happens? Perhaps 
the orbit of zo converges to the root of F(z) closest to zo? Or perhaps the orbit does 
not settle down, but wanders, hopelessly, forever? 

Let us make a computergraphical experiment to help answer these questions. 
We use the Escape Time Algorithm to produce a picture of those points zo eC 
whose orbits converge to a\. Define W = {(jc, y) e C : — 2 < x < 2, — 2 < y < 2} 
and V = {z e C : |z — a\ \ < 0.0001}. The real and imaginary parts of fix + iy) are 
given by 

' (ce + df) , . (de-cf) 

(e z + f z ) (e 2 + f 2 ) 
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where a = x 2 — y 2 , b = 2xy, c = 3a 2 — 3b 2 + 1, d = 6ab, e = 4(xa — yb), and / = 
4(xb + ya). Program 7.1.1 is modified accordingly. Pixels corresponding to points 
in W whose orbits reach V in less than a fixed number of iterations are plotted. 
A picture resulting from such an experiment is shown in Figure VII. 207. See also 
Figures VII.208 and VII.209. 

Color Plate 14 shows the output from another such experiment. This time Mer- 
cator's projection is used to represent the Riemann sphere, and points whose orbits 
converge to the different points a\, aj, a?,, and a 4 are plotted in different colors. 

The following definition is equivalent to Definition 1 . 1 in the case of polynomials, 
[Brolin]. 

Definition 3.1 The Julia set of a rational function / : C — ► C, of degree greater 
than 1 , is the closure of the set of repulsive periodic points of the dynamical system 
{C; /}. 

For the rational function f(z) considered above, one can prove that the Julia set J 
is the same as the set of points whose orbits do not converge to any one of the points 
a\, a2, aj, a 4 . In Figure VII. 207, J n W is represented by the boundary between 
the black and white regions. In Color Plate 14, J n W is the place where the four 
colors meet. The complement of the Julia set consists of four open sets, the basins 
of attraction of the four attractive fixed points of the Newton iteration scheme. In 
Color Plate 14 the dark green region represents part of the basin of attraction of a\. 
The black regions in the color plate are caused by (a) rounding errors, and (b) the fact 
that only 100 points on each orbit are tested, for convergence to one of the points a\, 
a2, a$, a 4 . 

The Julia set J is the part of C on which chaotic dynamics occurs. It can be 
characterized as the closure of the set of points whose orbits wander, hopelessly, 
forever. Orderly, slightly boring motion takes place on C \ J. J is the boundary of 
the white region. It is the boundary of the dark green region. It is a bonafide fractal, 
yet nobody knows its fractal dimension. 

There is a beautiful theorem of Sullivan, which can be illustrated using the 
"petals" in Color Plate 14. The complement of the Julia set is the union of a count- 
able collection of connected open sets, which we call petals. If P is a petal then 
f(P) is another petal. The Non-Wandering Domain Theorem [Sullivan 1982] says 
that no connected component of the complement of the Julia set wanders, hope- 
lessly, forever. It always settles into a periodic orbit of petals. If P is a petal in 
the present example, then one can prove that there is a positive integer S so that 

f oS (P) = /o(S+D(p) = yo(S+2) (p) = yo(S+3) (p) = . . . The fina j peta j yoS (p) j s 

one of the connected components of the complement of the Julia set that contains 
one of the points a\, aj, «3, a 4 . Each petal is eventually periodic. The orbit of a petal 
ends up in a cycle of petals of period 1 . 

How are we to think about this fabulous Julia set? IFS theory provides a simple 
point of view, as we show next. We begin by defining the inverse map associated 
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Figure VII.207. The 

Escape Time Algorithm 
is applied to analyze 
Newton's method for 
finding the complex roots 
of the polynomial z 4 - 1 . 
The boundary of this 
region represents the 
Julia set for the rational 
function /(z) = (3z 4 + 
l)/4z 3 . The points plotted 
black are those points z = 
x + iy with -2 < x < 2, 
and -2 < y < 2, whose 
orbits intersect V = [z € 
C: \z + 1| < 0.0001} in 
less than 1,000 iterations. 




Figure VII.208. The 

boundary of this region 
represents the Julia set 
for the rational function 
/(z) = (3z 4 +D/4z 3 .The 
two shades of gray, black 
and white, correspond to 
the basins of attraction of 
the four attractive fixed 
points of /(z). To which 
point in C do the orbits of 
points in the white region 
converge? 
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Figure VII.209. Mer- 
cator's projection of the 
Riemann sphere show- 
ing the basins of attrac- 
tion of the four attractive 
fixed points of the Newton 
transformation of z 4 — 1 . 
The top of the rectangle 
corresponds to the point 
at infinity, and the bottom 
of the box corresponds to 
the origin. The points —1, 
+i , + 1 , and — 1 lie on the 
equator. The shading fol- 
lows the same convention 
as in (a) and (b). Which 
point on C is represented 
by the midpoint of this 
image? 



with /. Let z G C be given and solve 

_ 3w 4 + 1 
4w 3 

to find w in terms of z. This leads to the quartic equation 

3w 4 - 4zw 3 + 1=0. 

This has four solutions, when we count solutions according to their multiplici- 
ties. We can organize these solutions to provide four functions; that is, we write 
f~ l (z) = {w i(z), W2(z), wi(z), W4(z)}. Then the Julia set is the "attractor" for the 
IFS {<C; Wi, i — 1, 2, 3, 4}. However, as in the case of quadratic transformations on 
C, this statement must be treated cautiously: for example, clearly this IFS admits 
more than one invariant set. 

Theorem 3.1 Let f : C — > C be the Newton transformation associated with the 

A A ° 

polynomial z — 1 . Let € > 0 be very small. Let X = C\ Uj =l B (at, e) where a\ = \, 
a2 = —1, a^ = i, and aA = —i. As above, define W : H(X) —> H(X) by 

W(B) = U 4 =l Wi(B) = f~\B) for all B e 7i(X). 

Then W is continuous, possesses a unique fixed point, J, the Julia set of f, and 

lim W on (B) = J for all B e H(X). 
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Figure VII.210. The 

Escape Time Algorithm is 
applied to a Newton trans- 
formation / (z) associated 
with a cubic polynomial. 
/ (z) possesses an attrac- 
tive cycle {b\, b 2 } of mini- 
mal period 2. The basin of 
attraction of the two-cycle 
is represented in white. 
Points whose orbits arrive 
within a distance of 0.01 
of the cycle, prior to 100 
iterations, are plotted in 
white. Does the basin of 
attraction of the cycle look 
familiar? 



Sketch Of proof This is essentially the same as the sketch of the proof of 
Theorem 2.1. 

The Newton transformation associated with a polynomial may possess an attrac- 
tive cycle of minimal period greater than 1. This cycle may not be directly related 
to the roots of the polynomial. As as example consider the Newton transformation 
f(z) associated with the polynomial 

F(z) = z 3 + (X-l)z+l. 

X e C can be chosen so that f(z) possesses an attractive cycle {b u b 2 } of minimal 
period 2. Figure VII.210 illustrates the basin of attraction of the cycle. The Escape 
Time Algorithm was used to obtain this image. Points whose orbits arrive within a 
distance 0.01 of the cycle, prior to 100 iterations, are plotted in black. Accordingly, 
the escape region is V = B(b x , 0.00001) U B(b 2 , 0.00001). Notice the resemblance 
of the basin of attraction of {b x , b 2 ) to the filled Julia set for z 2 - 1 . This similarity is 
not accidental. It can be explained using the theory of "polynomial-like" mappings 
[Douady 1985]. 

Some interesting computergraphical experiments involving Julia sets for New- 
ton's method are described in [Curry 1983], [Peitgen 1986], and [Vrscay 1986]. 
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Examples & Exercises 

3. 1 . Verify that z = 1 is an attractive fixed point for the Newton transformation 
associated with F(z) = z 4 — 1. 

3.2. The Newton transformation associated with the polynomial F(z) = z 2 + 1 is 

Show the corresponding IFS is {€; u>i(z) = z + Vz 2 + 1, w 2 (z) = z - Vz 2 + 1}, 
where the square root is defined appropriately. Verify that A = D& U {oo} is an attrac- 
tor of the IFS. Prove, or give evidence to show, that A is the Julia set for /(z). (Hint: 
see exercise 3.7.) How could the space t be modified so that the IFS has a unique 
attractor? Notice that numerically computed orbits of points on A, under the dynam- 
ical system {£; /}, can be constrained from escaping from A by keeping imaginary 
parts equal to zero. Verify numerically that the dynamics of {A; /} are wild. 

3.3. Find the Newton transformation /(z) associated with the polynomial F(z) = 
z 3 - 1. Use the Escape Time Algorithm to obtain an image, analagous to Fig- 
ure VII.207, which illustrates this Julia set. Discuss the dynamics of the "petals" 
in the image. 

3.4. In this example we speculate on the application of fractal geometry to bio- 
logical modelling. Let F x (z) = (z - iA)(z - l)(z + 1), where k is a real parameter, 
and let f x (z) denote the associated Newton transformation. Let J x denote the Julia 
set for fx(z). In Figure VII.21 1 we show images relating to J x , for an increasing se- 
quence of values of X. These images were computed by applying the Escape Time 
Algorithm to f x . 

These images show complex blobs that are reminiscent of something small, bi- 
ological, and organic. They make one think of the nuclei of cells; of collections of 
cells during the early stages of development of an embryo; of the process of cell di- 
vision; and of protozoans. As we track the images we see that the blobs pass through 
one another. Somehow they do so while preserving their complex geometries. Their 
geometries seem to interact with one another. Such images suggest that fractal ge- 
ometry can do more than provide a means for modelling static biological structures, 
such as ferns: it appears feasible to construct deterministic fractal models, which 
describe the processes of physiological change that occur during the growth, meta- 
morphosis, and movement of living organisms. 

3.5. Find the Newton transformation /(z) associated with the function F(z) = 
e z - 1. What are the attractive fixed points of the dynamical system {C; /}? Fig- 
ure VII.212 was computed using the Escape Time Algorithm applied to /(z), with 
W={(jc,)i)€R 2 : -2.5 < x < 2.5, -2.5 < y < 2.5}. Describe the main features 7 of 
the image. Explain, roughly, the causes of some of these features. 

3.6. What are the "petals" in the case of the Julia set for z 2 - 1? Use a picture of 
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Figure VII.211. (a)- 
(h) Julia sets associated 
with a one-parameter fam- 
ily of dynamical systems. 
Can such systems be used 
to model biological pro- 
cesses such as meiosis? 



Figure VII.211. (b) 
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Figure VII.211. (d) 
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(e) 




Figure VII.211. (f) 
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the Julia set for z 2 - 1 to illustrate the orbit of a tiny petal that is eventually periodic 
with minimal period 2. 

3.7. By making an explicit change of the coordinates, using a Mobius transforma- 
tion, show that the following two dynamical systems are equivalent: 

{C; f(z) = U z -l\] and {€; f(z) = z 2 }. 



Figure VII.212. This 
image was computed us- 
ing the Escape Time Algo- 
rithm applied to the New- 
ton transformation associ- 
ated with f(z) — e z - 1. 
The viewing window 
is W = {(*, v) e : 
-2.5 < x < 2.5, -2.5 < 
v < 2.5}. Can you work 
out what "escape region" 
was used? 



4 A Rich Source for Fractals: Invariant Sets of Continuous Open Mappings 

Let / be a transformation that acts on a space X. Recall that a set A is invariant under 
/ if f~\A) = A, and that this implies f(A) = A. We are interested in invariant sets 
of / that belong to ft(X). The following theorem is a theoretical tool that provides 
both the existence and a means for the computation of invariant sets. Most of the 
material in this chapter is based on it. 

Theorem 4. 1 Let^ (Y, d) be a metric space. Let X c Y be compact and 
nonempty. Let f : X ^ Y be continuous and such that /(X) D X. Then, (1 ) a trans- 
formation W : ft(X) H(X) is defined by 
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W(A) = f~\A) for all A e H(X). 

(2 ) W possesses a fixed point AeH(X), given by 

A = n~/ 0( -" ) (X)= lim W on (X). 

Suppose f obeys the additional condition that f(G) is an open subset of the metric 
space (/(X), d) whenever O cXis an open subset of the metric space (X, d). Then 

(3) W is a continuous transformation from the metric space (H(X), h(d)) into itself 

Proof of (1) and (2). The proof of (3) can be found in (Barnsley 
1988c).. (1) We begin by proving that W maps H(X) into H(X). Let B e H(X). 
The condition /(X) D X implies that f~\B) c X, and that f~ l (B) is nonempty. B 
is compact, so it is a closed set in the metric space (X, d). It follows that X \ B is 
open. The continuity of / implies that / _1 (X \ B) is open. Since /(X) D X d B, it 
follows that f~\B) = X\f~ l (X\ B). Hence f~\B) is closed in the metric space 
(X, d). Since X is compact it follows that f~\B) is compact. This completes the 
proof of (1). 

(2) Since /(X) D X, it follows that X D / o( - 1} (X). Application of f o( ~ n) to both 
sides of the latter equation yields 

X D / o( - 1} (X) D / o( " 2) (X) D / 0( ~ 3) (X) • • • D / 0( -" } (X) D • • • . 

It follows that {f o{ ~ n) (X)} is a Cauchy sequence in H(X), and it possesses a limit 
A e H(X), given by 

A = n«L 0 f o{ - n) (X)= lim W° n (X). 

It remains to be proved that A is a fixed point of /. We need to show that 
yo(-n (n ~ q a„) = n~ 0 A n where A n = f o{ - n \X) for n = 1, 2, . . .. First we prove 
that / 0( - 1} (n~ 0 A„) D n^° =0 A„. Suppose x € /^""(n* 0 A„). Then there is y e 
n™ =0 A n such that x inf o(_1) (y), and y e A n for n = 0, 1, 2, . . .. It follows that x e 
/ o( - 1} (A„) = A„+i for n - 0, 1, 2, . . .. It follows that x € n£l 0 A„. To prove the in- 
clusion the other way around, suppose that x e n£l 0 A„. Then x e A n+i = f~ l (A n ) 
for n = 0, 1, 2, 3, . . .. It follows that there is y n € A„ such that f(y n ) = x for n = 
0, 1, 2, 3 . . .. The sequence {y„} possesses a convergent subsequence. Let y denote 
the limit of this subsequence. Then y € A n forn = 0, 1, 2, . . ., and so y € n£L 0 A„. 
Since / is continuous it follows that f(y)=x. Hence x € / o(-1) (n£L 0 A„). We have 
shown that f o( ' l \r\% 0 A n ) C n£i 0 A„. This completes the proof of (2). 

The invariant set A referred to in Theorem 4.1 can be expressed 

A = {x e X : f on (x) e X for all n = 1, 2, 3, . . .}. 

That is, A is the set of points whose orbits do not escape from X. Also, A is the 
complement of the set of points whose orbits do escape. If X c R 2 , then pictures of 
A can be computed by using the Escape Time Algorithm. 
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The last statement in the theorem expresses a desirable property for a transforma- 
tion W on H(X). If W is not continuous, yet A 0 e H(X) and (W°"(A 0 )} converges 
to A e H(X), one cannot conclude that W(A) = A. Without continuity of W, one 
should not trust the results of applying the Escape Time Algorithm. For example, 
slight numerical errors may mean that a computed sequence of sets [A n « W on (X)} 
is not decreasing. One may still wish to define A = lim A n . Without continuity one 
cannot suppose that f~ l (A) = lim W(A n ) — A, even approximately. 

Analytic transformations map open sets to open sets. Hence their inverses act 
continuously on the space H(X), where X c C is chosen appropriately. To help 
vizualize this, look back at Figure 111.52. If the Sierpinski triangle ABO is deformed 
or moved, its inverse image POQU POQ will move continuously with it. 

The Hausdorff metric is the metric of perception: what we call a small change in 
the appearance of a picture is probably a small change in a Hausdorff distance. When 
one talks about continuous motion in the context of graphics, continuous growth in 
the context of botany, or continuous change in the context of a chemical system, 
the word "continuous" can often be replaced, pedantically, by "continuous in the 
Hausdorff metric." Theorem 4.1 suggests that one could use continuous open maps 
to model such systems. 

Examples & Exercises 

*4.1 . Let A. € [-1, 1]. Define a transformation / : IR 2 -> IR 2 by 



Show that / is continuous. Show that if X denotes a ball, centered at the origin, of 
sufficiently large radius, then /(X) D X. Also verify that if X e [-1, 0] then / maps 
open sets into open sets. Show that this is not the case for X = 1. (Hint: look at what 
the map does to a very small disk centered at the origin.) 

Figure VII.213 shows the result of applying the Escape Time Algorithm to / 
when X = 1. The inner region, bounded by an ellipse, actually represents a disk 
X such that /(X) D X. Different scales have been used in the x and y directions. 
/(X) is the region bounded by the outer curve. The image of a point that goes once 
around the inner ellipse is a point that goes twice around the origin, following the 
outer curve, which looks like a folded figure eight. Different "escape times" of orbits 
of points in X are represented by different gray tones. A magnified version of X, 
painted by escape times, is shown in Figure VII.214. Roughly speaking, regions 
closest to the outside escape fastest. Points in the white region also escape. So 
where is the invariant set A? It is right in the middle. It appears to be a branching, 
connected, tree-like set, with no interior. 

Figure VII.215 shows the result of applying the Escape Time Algorithm to / 
when X = 0. This time we see that the invariant set A, in the center, in white, is 
just the filled Julia set for z 2 — 1 . 



fix, y) 



(x 2 — y 2 — 1, 2xy) 
(x 2 - y 2 - 1 + Xx, 2xy) 



when x > 0, 
when x < 0. 
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Figure VII.213. The result of applying the Escape Time Algorithm to the function / 
in example 4.1 with X = 1. This function is continuous and is such that /(X) D X, where 
X denotes the Faberge egg in the middle. /(X) is the region bounded by the outer curve. 
Different "escape times" of orbits of points in X are represented by different gray tones. 



What happens if we choose I = -1? This time we obtain the image shown in 
Figure VII.216. However, this time things may not be as simple as they appear to be. 
The inner "layers" that surround the apparent invariant set A are highly irregular and 
unstable. That is, points that are very close together seem to have orbits that have 
very different escape times. Could it be that, although the mapping W : H(X) -> 
H(X) is continuous, it has a very poor modulus of continuity? 

4.2. Construct a function / : X -* R 2 , where X C K 2 , which obeys the conditions 
of Theorem 4.1. Use the Escape Time Algorithm to analyze the associated invariant 
set A described in the statement of the theorem. Your example should be interesting 
and of a different character from those specifically described in this chapter. 
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Figure VII.214. A magnified version of the region X in Figure VII.213. Approximately 
orbits of points closest to the outside escape the fastest. Points in the white region also escape 
The invariant set A is in the middle. It appears to be a branching, connected, tree-like set It is 
surrounded by layers, just as the center of a real tree is surrounded by layers of growth 
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Figure VII.215. The result of applying the Escape Time Algorithm to / in exam- 
ple VII.213 with A = 0. This time we see that the invariant set A, in the center, in white, 
is just the filled Julia set for z 2 — 1 . 
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Figure VII.216. This image was computed using the Escape Time Algorithm applied 
to the Newton transformation associated with f(z) = e z - 1. The viewing window is 
W = [(x, y) € R 2 : -2.5 < x < 2.5, -2.5 < y < 2.5). Can you work out what "escape 
region" was used? ^ 



Chapter VIII 

Parameter Spaces and 
Mandelbrot Sets 

1 The Idea of a Parameter Space: A Map of Fractals 

A map with nothing marked on it is practically useless. A map of a 1000 x 1000 
square mile region containing the British Isles is shown in Figure VIII.217; it does 
not convey much information. However, as a concept it is quite exciting. Each lo- 
cation on the map corresponds to somewhere on Earth. For example, the dot with 
coordinates (750, 227.3) represents the town of Maidstone. A point on the map may 
represent a certain grain of soil in a ploughed field, or a molecule of flotsam on the 
top of some foam on the surface of the sea. Nearby points in the map correspond 
to nearby points on the Earth. Connected sets with interiors correspond to physical 
regions. 

How could a perfect map be made? Ideally, it should specify locations on the 
Earth's surface at a certain instant. The coordinates would be relative to some abso- 
lute coordinate system, perhaps determined by reference to the fixed stars. Moreover, 
the surface of the Earth would have to be denned precisely, up to the last molecule 
of water, soil, and plant matter; for this purpose one can imagine using a straight line 
from the center of the Earth as suggested in Figure VIII.218. Of course, maps are not 
made like this, but the goal is the same: to have an accurate correspondence between 
points on the physical surface of the Earth and the physical surface of the paper. 

We must be careful how we interpret a map. Geographical maps are complicated 
by the real number system and the unphysical notion of infinite divisibility. Mathe- 
matically, the map is an abstract place. A point on the map cannot represent a certain 
physical atom in the real world, not just because of inaccuracies in the map, but be- 
cause of the dual nature of matter: according to current theories one cannot know the 
exact location of an atom at a given instant. 

Fractal geometers avoid this problem by pretending that the surface of the Earth is 
an abstract place too; we imagine, once again, that matter is infinitely divisible, and 
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1000 miles 




Figure VIII.217. A 

map with no information 
marked on it is quite excit- 
ing. A very unhelpful map 
of the British Isles. The 
dot represents Maidstone. 
The map is accurate up to 
dot size. 



1000 miles 



that we can address every point. In the same spirit, we presume that we can model 
trees, clouds, horizons, churning seas, and infinitely finely defined coastlines. Then, 
for example, we can define the Hausdorff-Besicovitch dimension of the coastline of 
the British Isles. 

For a map to be useful it must have information marked on it, such as heights 
above sea level, population densities, roads, vegetation, rainfall, types of underlying 
rock, ownership, names, incidence of volcanoes, malarial infestation, and so on. 
A good way of providing such information is with colors. For example, if we use 
blue for water and green for land then we can "see" the land on the map and we 
can understand some geometrical relationships. We can estimate overland distances 
between points, land areas of islands, the shortest sea passage from LLanellian Bay 




exact center of Earth 
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Figure VIII.219. In 

this map points corre- 
sponding to land have 
been shaded. A fascinat- 
ing entity, the coastline, is 
revealed. 



1000 miles 




-1000 miles 



to Amylwch Harbour, the length of the coastline, etc. All this is achieved through 
the device of marking some colors on a blank map! 

Let us consider the boundary of the shaded region in Figure VIII.219. It is here 
that the map conveys extra information. In the interior of the shaded region we learn 
no more about the surface of the Earth than that "there is land there." However, on 
the boundary we learn not only that "there are land and sea there," but also, if the 
map is accurate enough, a feature that we will actually "see" on the surface, namely 
the local shape of the coastline. 

The latter idea can be extended. If we include more colors on a geographical map, 
to provide more information about properties of the Earth's surface, we produce 
more boundaries on the map. These boundaries can give information about local 
geometry. For example, a map finely colored according to elevation reveals the 
shapes of the bases of the mountains, the paths of rivers, and - if we look closely 
enough— the outlines of buildings. Such a map, made abstract and perfect, placed in 
a metric space, would contain much detailed information about what, at each point, 
the local observer would see. 



Examples & Exercises 

1.1. Study an atlas that contains maps colored according to diverse criteria, such as 
rainfall, population density, vegetation, and elevation. Discuss to what extent these 
maps provide information about the local geometry of the surface of the Earth. 

We turn attention to making colored maps of parameterized families of fractals. 
We consider families of iterated function systems and families of dynamical systems 
that depend on two real parameters. The collection of possible parameter values 
defines a parameter space associated with the family. We use the notation P to 
denote a parameter space. Typically P is a subspace of (R 2 , Euclidean) such as ■, 
a closed ball, or R 2 . 

An example of a parameter space is P = {(*i, X 2 ) € R 2 : |*il, |A. 2 | < 2 ■ }. This 
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Figure VIII.220. An 

example of a parameter 
space. Each point A. in 
the space corresponds to 
a fractal, A(X). This is a 
poor map, because very 
little has been marked on 
it. It is like the map of 
the British Isles shown in 
Figure VIII.217. It needs 
coloring. 



is a parameter space for the family of hyperbolic IFS {€; (Ai + /A 2 )z + 1, (Ai + 
/A 2 )z — 1}. Each point A = (Ai, A 2 ) £ P corresponds to an IFS. Each IFS possesses a 
unique attractor, say A (A). Hence each point of P corresponds to a single fractal. We 
can think of P as representing part of H(€), a space of fractals. A map of P, with a 
few points marked on it, is shown in Figure VIII.220. Each point in P corresponds 
to a single fractal. Nearby points in P correspond to nearby fractals, that is, points 
in H(C) whose Hausdorff distance apart is small. 

Another example is P =C, which provides a parameter space for the family of 
dynamical systems {C; f x (z) = z 2 - A}. Each point in the parameter space corre- 
sponds to a different dynamical system. Each dynamical system is associated with a 
unique Julia set, 7(A). The collection of fractals {7(A) : A e P}, associated with the 
parameter space, is vast and diverse. 

Let X be a two-dimensional metric space such as R 2 or C Let P denote a pa- 
rameter space corresponding to a family of fractals {A(k) e H(X) : A e P}. Can we 
provide the explorer of fractals, who wishes to investigate this collection of fractals, 
with a colored map? This map should give her information about the sets A (A) to be 
found at different points on P. 

Suppose P = ■. To make a map, let us represent P by the set of pixels on the 
screen of a computergraphics monitor. The idea is to color the pixel A according 
to some property of A (A). (We write A = pixel = point in P, without repeatedly 
explaining that A is a point in the small rectangle in P that corresponds to the pixel.) 
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Suitable properties could relate to the connectivity of A (A.), the fractal dimension of 
A (A.), the escape time near a special point on A (A) under an associated dynamical 
system, the number of holes in A(A), or the presence of straight lines in A (A.), for 
example. 

If we make a good selection of the properties to associate with colors, the result 
will be a useful map containing various differently colored regions. This map will be 
a ready reference for the explorer of fractals. It will tell him something about what 
to expect at he travels about P. He might be suprised nonetheless. 

The boundaries of the colored regions can provide additional geometrical infor- 
mation to the explorer, over and above the information that the map was originally 
designed to convey. It sometimes occurs that the local shapes of the boundaries in 
the map reflect the shapes of the corresponding fractals. There is a deep principle 
here, which we shall not pin down as a theorem, but which we will illustrate in a 
number of cases. 

Examples & Exercises 

1 .2. It is often useful, when establishing the Lipshitz continuity condition in Theo- 
rem 1 1.1 in Chapter III, to remove dependencies on x e X by restricting the domain 
of the IFS to a compact set B that contains the attractors A (A) for all values of A e P. 
Since this does not change the attractors themselves, continuous dependency on A in 
B then guarantees it for the original IFS. This can be done in practice by means of 
the following lemma: 

Lemma 1.1 Let B e H(X) such that for allpeP, W(p, B) C B. Then for all 
peP, A(p) C B. 
Prove the lemma. 

1.3. Let P = {(Ai, A 2 ) e 1R 2 : |A.i|, |A 2 | < 0.9}. The family of IFS {R; A^, k 2 x + 
1 - A 2 , 0.5jc + 0.5} is hyperbolic, with contractivity factor s = 0.9, for all A e P. 
Use Theorem 3. 1 1 . 1 and the lemma of the previous problem to prove that the attrac- 
tor A (A) depends continuously on A. 

1 .4. The family of IFS {[0, 1]; Aut 2 , A 2 x + (1 - A 2 )} is hyperbolic, with contractiv- 
ity factor s = 0.9, for all A in the parameter space 

p = {(Ai, A 2 ) e 1R 2 : 0 < Ai < 0.45, 0 < A 2 < 0.9}. 

Since we have 

d(w lp (x), w lq (x)) = \x 2 \\k lp - k lq \ < |A lp - Xi t | 
d(w 2p (x), w lq (x)) = \x- l\\k 2p - A 2 J < |A 2p - A 2 J, 

we can satisfy the Lipshitz condition with k = 1 in both cases. The attractor depends 

continuously on A. 

1 .5. Sometimes the Lipshitz condition is not that convenient to prove directly. In 
this case, we can go directly to the continuity statement that it is used to guarantee. 
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Suppose we can show directly that w, (p, x) is a continuous function from PxX^ 
X. That is, without holding either p or x fixed, given e > 0, there is a 8, > 0 for each 
Wi(p, x), such that 



Then the conclusions of Theorem 11.1 in Chapter III hold. This does not mean 
that the Lipshitz condition (or something like it) is unnecessary; such conditions 
guarantee the statement we have just made. If we can guarantee it some other way, it 
is equivalent. 

1.6. An example of a parameter space is (P, Euclidean), where P = {(X\, k 2 ) e R 2 : 
I- 1 h I < 0.999}. This space can be used to represent the family of hyperbolic IFS 
{K 2 ; w\, w 2 ), where 



Since (using (z, w) = (X\, k 2 ) to emphasize familiarity) 

w\(x, y, z, w) = (zx, wy + 1) 

is continuous from basic calculus, Theorem 1 1.1 in Chapter III holds, and the attrac- 
tor varies continuously with A.. 



2 Mandelbrot Sets for Pairs of Transformations 

Let P c 0& 2 be a parameter space corresponding to a family of fractals. That is, 
we have a function A: P -> H(X), so that each point A e P corresponds to a set 
A (X) e H(X). One way to make a map is to color the parameter space according to 
whether or not A (A) is connected. 

Theorem 2. 1 Let {X; w\ , W2} be a hyperbolic IFS with attractor A. Let w\ and 
wi be one-to-one on A. If 



d(wi(p,x), Wi(q, y) <e 



whenever d((p, x), (q, x)) < 6,-. 




tui(A) fl w 2 (A) = &, 



then A is totally disconnected. If 



w l (A)nw 2 (A)^&, 



then A is connected. 



Proof Suppose that 1^1 (A) n w 2 (A) = 0. Let E denote the code space map asso- 
ciated with the IFS. By theorem 2.2 in Chapter IV the code space map <t> : £ -> A is 
invertible. 0 is also a continuous transformation between two compact metric spaces. 



300 Chapter VIII Parameter Spaces and Mandelbrot Sets 

Hence, by Theorem 8.5 in Chapter II, <p is a homeomorphism. Hence A is homeo- 
morphic to code space, which is totally disconnected. (Recall that code space on two 
or more symbols is metrically equivalent to a classical Cantor set.) It follows that A 
is totally disconnected. 

Suppose that wi(A) D 102(A) ^ 0. Then there is at least one point x e u>i(A) fl 
102(A). This point x has two addresses, say 

x = 0(£) = (j>(a), where i;\ — 1 and o x = 2. 

Let us see what happens if we additionally suppose that "A is not connected." Then, 
since A is compact, we can find two nonempty compact sets E and F so that 

A = EUF, E D F = 0. 

Using compactness, there is a positive real number 8 so that 

d(e,f)>8 for all e e E, f e F. 

Let 7r and f be a pair of codes that agree through the first K symbols, for some 
positive integer K. That is 7r, = x/s, for i — 1, 2, . . . , K. Then 

d(<j)(7t), 0(VO) < s*diam(A), 

where diam(A) = Max{d(x, y) : x, y e A}, and s e [0, 1) is a contractivity factor for 
the IFS. Suppose also that (j>(n) e E and (j)(x(r) e F. Then 

8< d(4>(TT),<t>W)). 

Combining the latter two inequalities we discover 8 < s*diam(A), which implies 

log(<Vdiam(A)) 

A. < . 

log (s) 

We conclude that if e e E and / e F then the number of successive agreements 
between an address of e and an address of / cannot exceed the number on the right- 
handside here. It follows that there is a maximum number, M, of initial agreements 
between the address of a point e e E and a point / e F; and this maximum is 
achieved on some pair of points, say e and /. Then we can find p t e {1, 2} for 
i = 1, 2, . . . , M such that 

P2, P3, Pm, 1, • • •) = e e E 

and 

<P(Pu Pi, P3, Pm, 2, ...) = / e F. 
Now consider the point z e A, which has the two addresses 

Z=(j>(Pl, Pi, P3, ...,Pm,1, Kl, £3, £4, • • •) 

= <P(Pi, Pi, P3, Pm, 2, o 2 , 03, oa, • • •)• 
Suppose zg£. Then its address agrees with that of / e F through (M +1) initial 
symbols. Hence z € F. But then its address agrees with that of e e E through (M + 
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Figure VIII.221. A 

map of the family of IFS 
{C; kz - 1, A.z + 1}, where 
the parameter space is 
P = {k = (A,, A 2 ) € C : 
| A. | < 1}. This picture of 
a parameter space is ob- 
tained by "painting" black 
where the attractor of 
the IFS is disconnected 
and light where it is con- 
nected. The Mandelbrot 
set is the light region, the 
sea. It contains a dragon at 
A = (0.5, 0.5). 




1) initial symbols, which is not possible. We have a contradiction. Hence "A is 
not disconnected." It follows that A is connected. This completes the proof of the 
theorem. 

Definition 2.1 Let {X; w\, w 2 } be a family of hyperbolic IFS that depends on a 
parameter k e P C M. 2 . Let A (A.) denote the attractor of the IFS. The set of points 
M C P defined by 

M = [k e P : A (A.) is connected} 
is called the Mandelbrot set for the family of IFS. 

For the rest of this section we consider the family of IFS 

{<C; kz - hkz + 1}, 

where the parameter space is 

P = {k = (ki, k 2 ) e C : k\ + k\ < 1}. 

Figure VIII.221 shows a picture of the associated Mandelbrot set, M. This is a map 
for the collection of fractals associated with the IFS. It has been colored dark where 
the attractor is totally disconnected and light where it is connected. 

Here is an outline of an algorithm to compute images of the Mandelbot set M 
associated with the family {€; w\(z) = kz-l, w 2 (z) = kz + 1}. It is based on The- 
orem 2. 1 . 
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Algorithm 2.1 Example of Method for Making Pictures of the 
Mandelbrot Set of a Family of IFS. 

(1) Choose a positive integer, L, corresponding to the amount of computation one 
is able to do. The greater the value ofL, the more accurate the resulting map 
image will be. 

(2) Represent the parameter space P = {X e C : |X| < 1} by an array of pixels. 
Carry out the following steps for each X in the array. 

(3) Calculate a number R, so that the attractor is contained in a ball of radius R, 
centered at the origin; that is, choose R > 0 so that A(X) C B(0, R). 

(4 ) Compute the number 

H = min{d(x, y) : x G iui(W oL ({0})), y e w 2 (W oL ({0}))}, 
where W = wiUw 2 .IfH < 2\X\ L+l R, then the pixel X is assumed to belong 
to M and is colored accordingly. 

Step (4) is based on the following observation. The attractor of the IFS is 
contained in the set W o(L+l) (B(0, R)), which consists of 2 L+1 balls of radius 
\X\ L+l R. The centers of these balls lie in the union of the two sets wi(W oL (0)) and 
w 2 (W oL (0)). If H is greater than 2\X\ L+l R, then A(X) must be disconnected. 

Figure VIII.222 shows the "coastal region" of a quarter of the complement of the 
Mandelbrot set in Figure VIII.221. It has been laid over a grid in order to help you 
locate points where interesting fractals lie. 

The boundary of M is complicated and intricate. Close-ups of the "coast- 
line" near the places marked (a), (b), and (c) are shown in Figure VIII.223. Fig- 
ure VIII.224 shows a zoom on the spiral peninsula in Figure VIII.222. 

Now look at Figures VIII.225 and VIII.226, which show pictures of the attractors 
A(X) for some points X located near the boundary of the Mandelbrot set. There is 
a "family resemblance" between the places on the boundary from which the fractals 
come, and the fractals themselves. To help see this you should look back at the close- 
ups on the coastline in Figure VIII.223. Figure VIII.227 shows the IFS attractor 
corresponding to the tip of the peninsula in Figure 224. Notice how it contains 
spirals, very much like the ones in the peninsula in parameter space. At the end of 
this chapter we make some comments on why such "family resemblances" occur. 

The following theorem provides rigorous bounds on the locations of M and dM. 
The proof is delightful, because it relies on a fractal dimension estimate. 

Theorem 2.2 [Barnsley 1985c] The attractor A(X) of the IFS {€; Xz-l,Xz + 
1} is totally disconnected if\X\ < 0.5 and connected if I > |X| > 1/V2. The bound- 
ary of the associated Mandelbrot set is contained in the annulus 1/2 < |X| < 1/V2. 

Proof Let A denote the attractor of the IFS and let D(A) denote its fractal di- 
mension. The two maps in the IFS are similitudes of scaling factor This means 
that Theorem 2.3 in Chapter V can be applied. 
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Figure VIII.222. This 
shows the coastal region 
of a quarter of the com- 
plement of the Mandelbrot 
set in Figure VIII.221. 
It has been laid over a 
grid in order to help you 
locate points where inter- 
esting fractals lie. Close- 
ups of the coast at (a), 
(b), and (c) are shown in 
Figure VIII.223. The co- 
ordinates of the grid are 
(0, 0) - -(0.71,0.71). 



Suppose that A is totally disconnected. Then the IFS is totally disconnected and, 
by Theorem 2.3 in Chapter V, 

D(A) = log(l/2)/ log(|A.|). 

By Theorem 2.1 in Chapter V, D(A) < 2. Hence 

log(l/2)/log(|A.|)<2. 

This implies that |A.| < 1/V2. 

Suppose that A is connected. Then it contains a path that connects two dis- 
tinct points. The fractal dimension of any path is greater than or equal to 1. Hence 
D(A) > 1. However, by Theorem 2.3 in Chapter V, 

D(A) < log(l/2)/log(|X|). 

It follows that 

l<log(l/2)/log(|A.|). 
This implies that |A.| > 1/2. This completes the proof of the theorem. 

A different point of view on the Mandelbrot set considered above is given by 
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Figure VIII.223. Close- 
ups of the boundary of the 
Mandelbrot set at (a), 
(b), and (c). The diverse 
structures in this bound- 
ary echo the shapes of 




the attractors of the corre- 
sponding IFS. 




Figure VIII. 228. M has been turned inside-out by making the change of variables 
X' = A." 1 . The inner white disk is no-man's land; it does not belong to the parameter 
space. Also included are the two bounds provided by Theorem 2.2, namely the circle 
| A/ 1 = 2 and the circle \X'\ = V2. The fractal dimension decreases with increasing 
distance from the origin. 



Examples & Exercises 

2. 1 . Sketch the Mandelbrot set for the family of IFS {[R; X t x + A. 2 , X 2 x + X\} where 
the parameter space is P = {(X\, X 2 ) : |Xi|, | A.^ I < 1}- 

2.2. Let {X; w 2 ) be a family of hyperbolic IFS which depends on a parameter 
X e P C R 2 . Let w\ and w 2 depend Lipshitz continuously on X for some k > 0 and 
for fixed x eX. Assume that the IFS has contractivity factor 5 e [0, 1) which is 
independent of X e P. Then by Theorem 11.1 in Chapter III, the function A : P -+ 
H(X) is continuous. Use this continuity to prove that the Mandelbrot set associated 
with the family of IFS is closed. It is suggested that you begin by showing that the 
set S = {B 6 H(X) : B is not connected} is an open subset of H{X). 
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Figure VIII.224. Close- 
up on the spiral penin- 
sula on the edge of 
the Mandelbrot set in 
Figures VIII.221 and 
VIII.222. What infor- 
mation about the corre- 
sponding fractals does this 
boundary convey? 



2.3. Use Figure VIII.222 to determine some values of X that belong, approximately, 
to the boundary dM of the Mandelbrot set. Compute images of the correspond- 
ing attractors. Compare images corresponding to two points A.i and X 2 e dM, with 
I A- 1 1 < |A.2|. Explain why the picture of A(X { ) is more delicate than the picture of 
A(A.2). Also comment on similarities and differences between your images and the 
local geography of the^arts of dM from which they come. 



2.4. The pictures of the Mandelbrot set associated with the family of IFS {<C; Xz - 



306 Chapter VIII Parameter Spaces and Mandelbrot Sets 



Figure VIII.225. Some 
of the fractals to be found 
at various points near the 
boundary of the Mandel- 
brot set associated with 
the parameterized family 
oflFS {£; kz- 1, Xz + 1}. 




1, Xz + 1} suggest the conjecture that M is symmetric about the x-axis and about 
the origin. Prove the conjecture. 

2.5. An interesting point in the parameter space for the family {C; kz - 1, kz + 1} 
is k = (1/2, 1/2). This lies on the circle 1/|X| = \k'\ = Vl in Figure VIII.228. It 
appears to be located in the interior of the Mandelbrot set, although the IFS is just- 
touching. It corresponds to the Twin-Dragon Fractal. A picture of it is shown in 
Figure VIII.229. It is possible to tile the plane with Twin-Dragons. Various other 
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Figure VIII. 226. Some 
of the IFS attractors to be 
found at various points 
near the boundary of the 
Mandelbrot set associated 
with the parameterized 
family of IFS {€; kz - 
1, kz + 1}. Where would 
you look for an interesting 
fractal? 



values of k also correspond to tilings of the plane. See [Gilbert 1982]. Show that the 
attractor at the point A = (0, 1 /a/2) can be used to tile the plane. 

2.6. Notice the line segments on the real axis in Figure VIII.228. In [Barnsley 
1985c] it is proved thai? 



{A e C : 0.5 < X x < 0.53; k 2 = 0} c M. 
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Figure VIII.227. At 

tractor of the IFS {C; kz - 
1, kz + 1} corresponding 
to the value of k at the 
tip of the spiral peninsula, 
shown in Figure VIII. 224. 




For k in such a line segment what does the attractor look like? Are you suprised, in 
view of what you know about maps of coastlines? 

2.7. Some of the most delicate attractors of the family {C; kz - 1, + 1} are 
associated with points on BM where it touches the circle 1 / 1 A. | = \k'\ = 2. These 
have the lowest possible fractal dimension while still being connected. Let us call 
these attractors tree-like if w x (A) n w 2 (A) is a single point. Argue (or, better yet, 
prove) that a tree-like attractor A contains no trapped holes; that is, A contains 
no nontrivial, non-self-intersecting paths that start and finish at the same point. A 
picture of a tree-like attractor is shown in Figure VIII.230. 

2.8. Let e — e\eje-s ... e n .... be a point in the code space E of two symbols, with 
e n € {+1, -1} for all n. Let k e C. Prove that the series 

f(k) = ei+ e 2 k + e 3 k 2 + e 4 ^ + e 5 k 4 + e 6 k 5 ... + e n k n + ... 

has radius of convergence 1. What is the relationship between f(k) and the code 
space map 0 : E — ► A(k) associated with the family of IFS {C; kz - l,kz + 1}? Let 
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DECREASING 
FRACTAL 
DIMENSION 



DECREASING 
FRACTAL 
DIMENSION 




TOTALLY 
DISCONNECTED 
HERE AND IN 
BLACK 



|A| < 1. Show that the attractor of the IFS is the union of all points which can be 
written in the form 



Figure VI 1 1. 228. Inside- 
out picture of the 
Mandelbrot set for 

{QXz- l,Xz + l}. It 
has been turned inside-out 
by making the change of 
variables k' = A -1 . The in- 
ner white disk is no-man's 
land; it does not belong 
to the parameter space. 
The figure also includes 
the two bounds provided 
by Theorem 2.2, namely 
the circle |A'| = 2 and the 
circle |A'| = sfl. 



±1 ± A ± A 2 ± A 3 ± A 4 ± A 5 ± A 6 ± A 7 ± A 8 ± A 9 ± A 10 ± A 11 ± 



3 The Mandelbrot Set for Julia Sets 

In this section we introduce a good method for making maps, such as might be found 
in an atlas, of families of dynamical systems. The method is based on the use of 
escape times and is discussed more generally in section 4. Here we restict attention 
-to the family 

{C; A(z) = z 2 -A}, 

where the parameter space is P = C. This family is of special importance because 
it provides a model for the onset of chaotic behavior in physical and biological 
systems; see [May 1976] and [Feigenbaum 1979]. Moreover, it was the first family 
of dynamical systems for which a useful computergraphical map was constructed, 
by Mandelbrot. We concentrate on map making. 
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Figure VIII.229. The 

Twin-Dragon Fractal. You 
can tile the plane with 
these sets. Although it is 
just-touching, it appears 
to lie in the interior of the 
Mandelbrot set. 



The Julia set J(X) associated with f x (z) is symmetric about the origin, O. We 
know this because the filled Julia set, of which 7(A) is the boundary, is the set of 
points whose orbits remain bounded. The orbit of z e C remains bounded if and only 
if the orbit of — z remains bounded. 

For some values of A e P, O belongs to the filled Julia set, F(X), while for others 
it is quite far from F(X). This suggests that we try to color the parameter space 
according to the distance from O to F(X). How can we estimate this distance? An 
approximate method is to look at the "escape time" of the orbit of O. That is, we 
can color the parameter space according to the number of steps along the orbit of 
O that are required before it lands in a ball around the point at infinity, from where 
we know that all orbits diverge. The intuitive idea is that the longer an orbit of 0 
takes to reach the ball, the closer O must be to F(X). Of course, if an orbit does not 
diverge then we know that O e F(X). 

Suppose that we want to make a map corresponding to a region W C P. Here we 

choose 

W = {X = (X 1 ,X 2 )eC:|X 1 |,|X 2 |<2}. 

Let R > 0 and define 

V(R) = {ze€: \z\ > R) U {oo}. 

Suppose 

R > 0.5 +0.25 + 

Then it is readily proved that the orbit {fZ"(z)} diverges if and only if it intersects 
V(fl). So if we choose R = 10 we are sure that, for all X e W, the orbit {/£"(())} 
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Figure VIII.230. A 

"tree-like" attractor 
A from the family 
{C; iui(z) = kz- l,w 2 = 
Xz + 1}. The two sets 
iui(A) and w 2 (A) meet 
approximately at a single 
point. 



diverges if and only if it intersects V(R). Let us see what happens if we color the 
pixels of W according to the number of iterations required to enter V(10). 

The following program is written in BASIC. It runs without modification on an 
IBM PC with Enhanced Graphics Adaptor and Turbobasic. On any line the words 
preceded by ' are comments: they are not part of the program. 

Program 1 . (Algorithm for Coloring Parameter Space According to an Escape Time) 
'Define viewing window, W, and numits. 

numits = 20 : a = -2 : b = -2 : c = 2 : d = 2 : M = 100 

R =10 'Define the region V. 

screen 9 : els 'Initialize graphics. 

for p =1 to M 

for q =1 to M 

'Specify the value of lambda (k,l) in P 
k = a+(c-a)*p/M : 1 = b+(d-b)*q/M 
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'Specify the initial point, 0, on the orbit 
x =0 : y =0 

'Compute at most numits points on the orbit of 0 
for n = 1 to numits 

newx = x*x-y*y-k 

newy = 2*x*y-l 

x = newx : y = newy 

'If the most recently computed point lies in V then... 
if x*x+y*y > R then 

'... render the pixel (p,q) in color n, and 
'go to the next (p,q). 
pset(p,q),n : n = numits 

end if 

if instat then end 'Stop computing if any key is pressed! 

next n : next q : next p 

end 

Color Plate 16 shows the result of running a version of Program 1 on a Masscomp 
5600 workstation with Aurora graphics. 

In Figure VIII.231 we show the result of running a version of Program 1, but this 
time in halftones. The central white object corresponds to values of X for which the 
computed orbit of O does not reach V during the first numits iterations. It represents 
the Mandelbrot set (defined below) for the dynamical system {£; z 2 — X}. The bands 
of colors (or white and shades of gray) surrounding the Mandelbrot set correspond 
to different numbers of iterations required before the orbit of O reaches V(10). 
The bands farthest away from the center represent orbits that reach O most rapidly. 
Approximately, the distance from O to F(X) increases with the distance from X to 
the Mandelbrot set. 

Definition 3. 1 The Mandelbrot set for the family of dynamical systems {£; z 2 - 
X} is 

M. = {X e P : J(X) is connected}. 



The relationship between escape times of orbits of O and the connectivity of J(X) 
is provided by the following theorem. 
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Figure VIII.231. The 

Mandelbrot set for z 2 — K 
computed by escape times. 



Theorem 3.1 The Julia set for the family of dynamical systems {C; fx(z) = 
z 2 - A}, X e P = C, is connected if and only if the orbit of the origin does not escape 
to infinity; that is 

M = {Xe£: |/ x °"(0)| Aooasn^oo} 



Proof This theorem follows from [Brolin], Theorem 11.2, which says that the 
Julia set of a polynomial, of degree greater than 1, is connected if and only if none 
of the finite critical points lie in the basin of attraction of the point at infinity. f x (z) 
possesses two critical points, 0 and oo. Hence J(X) is connected if and only if 
A oo as n ^ oo. 

In this paragraph we discuss the relationship between the Mandelbrot set for 
the family of dynamical systems {€; z 2 - M and the corresponding family of IFS 
{C; VFTX, -slz + >.}• We know that for various values of X in C the IFS can 
be modified so that it is hyperbolic, with attractor J(X). For the purposes of this 
paragraph let us pretend that the IFS is hyperbolic, with attractor J(k),for allXeC. 
Then Definition 2.1 would be equivalent to Definition 3.1. By Theorem 2.1, the 
attractor of the IFS would be connected if and only if wi(J(X)) n w 2 (J(X)) 0- But 
wi (Q n w 2 (€) = 0. Then it would follow that the attractor of the IFS is connected if 
and only if 0 e J(X). In other words: we discover the same criteria for connectivity 
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of J(k) if we argue informally using the IFS point of view, as can be proved using 
Julia set theory. This completes the discussion. 

We return to the theme of coastlines and the possible resemblance between fractal 
sets corresponding to points on boundaries in parameter space and the local geome- 
try of the boundaries. Figures VIII.232 and VIII.233 show the Mandelbrot set for 
z 2 - k, together with pictures of filled Julia sets corresponding to various points 
around the boundary. If one makes a very high-resolution image of the boundary 
of the Mandelbrot set, at a value of k corresponding to one of these Julia sets, one 
"usually" finds structures that resemble the Julia set. It is as though the boundary of 
the Mandelbrot set is made by stitching together microscopic copies of the Julia sets 
that it represents. An example of such a magnification of a piece of the boundary of 
M, and a picture of a corresponding Julia set, are shown in Figures VIII.234 and 
VIII.235. 

If you look closely at the pictures of the Mandelbrot set M considered in this 
section, you will see that there appear to be some parts of the set that are not 
connected to the main body. Pictures can be misleading. 

Theorem 3.2 [Mandelbrot-Douady-Hubbard] 

The Mandelbrot set for the family of dynamical systems {€; z 2 — k} is connected. 
Proof This can be found in [Douady 1982]. 

The Mandelbrot set for z 2 - k is related to the exciting subject of cascades of bi- 
furcations, quantitative universality, chaos, and the work of Feigenbaum. To learn 
more you could consult [Feigenbaum 1979], [Douady 1982], [Barnsley 1984], [De- 
vaney 1986], [Peitgen 1986], and [Scia 1987]. 

Examples & Exercises 

3. 1 . Rewrite Program 1 in a form suitable for your own computergraphical envi- 
ronment. Run your program and obtain hardcopy of the output. Adjust the window 
parameters a, b, c, and d, to allow you to make zooms on the boundary of the Man- 
delbrot set. 

3.2. Figure VIII.236 shows a picture of the Mandelbrot set for the family of dynam- 
ical systems {€; z 2 - k} corresponding to the coordinates -0.5 < k\ < 1.5, -1.0 < 
k 2 < 1 .0. It has been overlaid on a coordinate grid. The middle of the first bubble has 
not been plotted, to clarify the coordinate grid. Let Bo, B\, B 2 , B 3 , . . . denote the se- 
quence of bubbles on the real axis, reading from left to right. Verify computationally 
that when k lies in the interior of B n the dynamical system possesses an attractive 
cycle, located in C, of minimal period 2 n , for n = 0, 1, 2, and 3. 

3.3. The sequence of bubbles {B n }^ =0 in exercise 3.2 converges to the Myreberg 

point, k = 1.40115 The ratios of the widths of successive bubbles converge to 

the Feigenbaum ratio 4.66920 .... Make a conjecture about what sort of "attractive 
cycle" the dynamical system {C; z 2 - k} might possess at the Myreberg point. Test 
your conjecture numerically. You will find it easiest to restrict attention to real orbits. 
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Figure VIII.232. Man- 
delbrot set for z 2 -K dec- 
orated with various Julia 
sets and filled Julia sets. 




Figure VIIL233. Man- 
delbrot set for z 2 — k, 
decorated with various 
Julia sets and filled Julia 
sets. These often resemble 
the place on the boundary 
from which they come, es- 
pecially if one magnifies 
up enough. 
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Figure VIII.235. A 

filled Julia set corre- 
sponding to the piece 
of the coastline of the 
Mandelbrot set in Fig- 
ure VHI.234. Notice the 
family resemblances. 
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Figure VIII.236. A 

picture of the Mandel- 
brot set for the family 
of dynamical systems 
{<C; v — A}. It has been 
overlaid on a coordinate 
grid. The middle of the 
first bubble has not been 
plotted, to clarify the coor- 
dinate grid. 



3.4. Make a parameter space map for the family of dynamical systems {C; fx(z)}, 
where f x is the Newton transformation associated with the family of polynomials 

F( Z ) = z 3 + (X - \)z + 1, XeP = C. 

Notice that the polynomial has a root located at z = 1, independent of X. Color 
your map according to the "escape time" of the orbit of O to a ball of small radius 
centered at z = 1. Use black to represent values of X for which O does not converge 
to z = 1. Examine some Julia sets of fx corresponding to points on the boundary of 
the black region. Are there resemblances between structures that occur in your map 
of parameter space, and some of the corresponding collection of Julia sets? (The 
correct answer to this question can be found in [Curry 1983].) 



4 How to Make Maps of Families of Fractals Using Escape Times 

We begin by looking at jhe Mandelbrot set for a certain family of IFS. It is disap- 
pointing, and we do not learn much. We then introduce a related family of dynam- 
ical systems and color the parameter space using escape times. The result is a map 
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Figure VIII.237. The 

complement of the Man- 
delbrot set M i associated 
with the family of IFS 
{C; wi = kz + 1, w 2 = 
k*z — 1}. Points in the 
complement of the Man- 
delbrot set are colored 
black. The boundary of 
Mi is smooth and does 
not reveal much infor- 
mation about the family 
of fractals it represents. 
The figure also shows 
attractors of the IFS corre- 
sponding to various points 
on the boundary of M i . 
What a disappointing map 
this is! 




packed with information. We generalize the procedure to provide a method for mak- 
ing maps of other families of dynamical systems. We discover how certain bound- 
aries in the resulting maps can yield information about the appearance of the fractals 
in the family. That is, we begin to learn to read the maps. 

Figures VIII.237 and VIII.238 show the Mandelbrot set Mi for the family of 
hyperbolic IFS 

{C; wi(z) = Xz + 1, w 2 (z) = r Z -l},P = {UC: |X| < 1}. 

We use the notation X* = (Xi + i'X 2 )* = (Xi - /X 2 ) for the complex conjugate of X. 
The two transformations are similitudes of scaling factor |X|. At fixed X, they rotate 
in opposite directions through the same angle. The figures also show attractors of 
the IFS corresponding to various points around the boundary of the Mandelbrot set. 
What a disappointing map this is! There are no secret bays, jutting peninsulas, nor 
ragged rocks in the coastline. 

Theorem 4.1 (Hardin 1985). The Mandelbrot set Mi is connected. Its 
boundary is the union of a countable set of smooth curves and is piecewise dif- 
ferentiable. 

Proof This can be found in [Barnsley 1988d]. 
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Figure VIII.238. The 

complement of the Man- 
delbrot set M\ associated 
with the family of IFS 
{C; w\ - Xz + 1, w 2 = 
X*z - 1}, together with 
some of the corresponding 
fractals. Notice how these 
have subsets of points 
that lie on straight lines, 
like the local structure of 
dM\. 



Let us try to obtain a better map of this family of attractors. In order to do so we 
begin by denning an extension of the associated shift dynamical system, for each 
\eP\Mi. Let A(X) denote the attractor of the IFS. One can prove that A(X) is 
symmetric about the y-axis. Hence A(X) e Mi if and only if A(X) intersects the 
y-axis. Define A : C C 



hiz) = 



Wy 1 (Z) 

■\z) 



w> 



if Rez > 0; 
if Rez < 0. 



Then when X is such that A(X) is disconnected, {A(X); A) is the shift dynamical 
system associated with the IFS; {C; A) is an extension of the shift dynamical system 
to all of C; and A(X) is the "repelling set" of {C; A). This system can be used to 
compute images of A(X) in the just-touching and totally disconnected cases, using 
the Escape Time Algorithm, as discussed in Chapter VII, section 1. 

We make a map of the family of dynamical systems {R 2 ; /*}, A. e P. To do this we 
use the following algorithm, which was illustrated in Program 8.3.1. The algorithm 
applies to any family o/ dynamical systems {R 2 ; A) that possesses a "repelling set" 
A(X), and such that P is a two-dimensional parameter space with a nice classical 
shape, such as a square or a disk. 
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Algorithm 4.1 Method for Coloring Parameter Space According to 
an Escape Time. 

(i) Choose a positive integer, numits, corresponding to the amount of computa- 
tion one is able to do. Fix a point Q e R 2 such that Q e A(k) for some, but 
not all, k G P. 

(ii) Fix a ball BcK 2 such that A(k) C B for alike P. Define an escape region 
tobeV=R 2 \ B. 

(Hi) Represent the parameter space P by an array of pixels. Carry out the fol- 
lowing step for each k in the array, 
(iv) Compute {f™(Q) : n = 0, 1, 2, 3, . . . , numits}. Color the pixel k according 
to the least value of n such that f™(Q) e V. // the computed piece of the 
orbit does not intersect V, color the pixel black. 

The result of applying this algorithm to the dynamical system defined above, with 
Q = O, is illustrated in Figures VIII.239 and VIII.240 (a)-(g) and Color Plates 17 
and 18. 

Figure VIII.239 contains four different regions. The first is a neighborhood of O, 
surrounded by almost concentric bands of black, gray, and white. The location of 
this region is roughly the same as that of P \ Mi, which corresponds to totally dis- 
connected and just-touching attractors. The second region is the grainy area, which 

(z-l)A ifRez>0; 
(z + l)A ifRez<0. 



Figure VIII.239. A 

map of the family of dy- 
namical systems {C; f k }, 
where 



Mz) = 



The parameter space is 
P = {k e C; 0 < ki < 
1,0 < k 2 <0.75}. The 
map is obtained by ap- 
plying Algorithm 8.4.1. 
Pixels are shaded accord- 
ing to the "escape time" 
of a point O eR 2 . The 
exciting places where the 
interesting fractals are to 
be found are not within 
the solid bands of black, 
gray, or white, but within 
the foggy coastline. This 
coastline is itself a fractal 
object, revealing infinite 
complexity under mag- 
nification. In it one finds 
approximate pictures of 
some of the connected 
and "almost connected" 
repelling sets of the dy- 
namical system. Why are 
they there? 
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Figure VIII. 240. A se- 
quence of zooms on a 
piece of the foggy coast- 
line in Figure VIII.239. 
The window coordinates 
of the highest power 
zoom are 0.4123 < \ { < 
0.4139,0.6208 < X 2 < 
0.6223. Can you find 
where each picture lies 
within the one that pre- 
cedes it? 



Figure VIII.240. (b) 
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Figure VIII.240. (c) 



Figure VIII.240. (d) 
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Figure VIII.240. (f) 
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Figure VIII.240. (g) 




we refer to as the foggy coastline. Here, upon magnification, one finds complex 
geometrical structures. An example is illustrated in the sequence of zooms in Fig- 
ure VIII.240 (a)-(g). The structures appear to be subtly different from one another. 
Early experiments show that if A. is chosen in the vicinity of one of these structures, 
then images of the "repelling set" of the dynamical system {IR 2 ; fx}, computed us- 
ing the Escape Time Algorithm, contains similar structures. An example of such 
an image is shown in Figure VIII.241. The third region, at the lower right in Fig- 
ure VIII.239, is made up of closed contours of black, grey, and white. Here the map 
conveys little information about the family of dynamical systems. To obtain infor- 
mation in this region one should examine the orbits of a point Q, different from 0. 
The fourth region, the outer white area in Figure VIII.239, corresponds to dynamical 
systems for which the orbit of O does not escape. It is likely that for A. in this region, 
the "repelling set" of the dynamical system possesses an interior. 

Our new maps, such as Figure VIII.239, can provide information about the family 

oflFS 

{C; «,,(*) = Xz + 1, w 2 (z) = X*z - 1}, P = {A e C : |X| < 1} 
in the vicinity of the boundary of the Mandelbrot set. For A. € dM\ the attractor of 
the IFS is the same as the repelling set of the dynamical system. For A. close to dMi 
the attractor of the IFS "looks like" the repelling set of the dynamical system. 

Figure VIII.242 shows a transverse section through the anther of a lily. We include 
it because some of the structures in Figure VIII.240 (a)-(g) are reminiscent of cells. 
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Figure VIII.241. Im- 
age of the repelling set 
for one of the family 
of the dynamical sys- 
tems whose parameter 
space was mapped in Fig- 
ure VIII.239. This image 
corresponds to a value of 
A. that lies within the high- 
est power zoom in Fig- 
ure VIII.240. Notice how 
the objects here resemble 
those in the corresponding 
position in the parameter 
space. 




Figure VIII.242. Lon 

gitudinal section through 
part of the stigma of a 
lily, showing germinating 
pollen-grains, h pappil- 
lae of stigma; p.g., pollen 
grains; t, pollen tubes. 
Highly magnified. (After 
DodelPort, [Scott 1917].) 
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Algorithm 1 in section 4 can be applied to families of dynamical systems of the 
type described in Theorem 4.1 in Chapter VII. For example, let {R 2 ; A), where 
X e P = m C R 2 , denote a family of dynamical systems. Let X C R 2 be compact. Let 
f k : X R 2 be continuous and such that /(X) D X. Then f k possesses an invariant 
set A (A.) e H(X), given by 

AW = n„v; H) (X). 

A (A.) is the set of points whose orbits do not escape from X. The set of points in P 
corresponding to which orbit of Q does not escape from X is 

M(Q) = {XeP:QeA(X)}. 

We conclude this chapter by giving an "explanation" of how family resemblances 
can happen between structures that occur on the boundary of M(Q) and the sets 
A(X). (1) Suppose that A(X) is a set in R 2 that looks like a map of Great Britain, 
translated by X. Then what does M(Q) look like? It looks like a map of Great 
Britain. (2) Suppose that A(X) is a set that looks like a map of Great Britain at time 
A.i, translated by X. We picture the set A(X) varying slowly, perhaps its boundary 
changing continuously in the Hausdorff metric as X varies. Now A(X) looks like a 
deformed map of Great Britain. The local coves and inlets will be accurate represen- 
tations of those coves at about the time A.i to which they correspond in the parameter 
space map. That is, the boundary of M(Q) will consist of neighboring bays and 
inlets at different times stitched together. It will be a map that is microscopically ac- 
curate (at some time) and globally inaccurate. (3) Now pretend in addition that the 
coastline of Great Britain is self-similar at each time X { . That is, imagine that lit- 
tle bays look like whole chunks of the coastline, at a given instant. Now what will 
M(Q) look like? At a given microscopic location on the boundary, magnified enor- 
mously, we will see a picture of a whole chunk of the coastline of Great Britain, at 
that instant. (4) Now imagine that for some values of X, Great Britain, in the dis- 
tant future, is totally disconnected, reduced to grains of isolated sand. It is unlikely 
that those values of X belong to M(Q). As X varies in a region of parameter space 
for which A(X) is totally disconnected, it is not probable that Q e A(X). In these 
regions we would expect M(Q) to be totally disconnected. 

The families of sets {A(X) eX.Xe P} considered in this chapter broadly fit into 
the description in the preceding paragraph. Both P and X are two-dimensional. The 
sets A(X) are derived from transformations that behave locally like similitudes. For 
each X e P, A(X) is either connected or totally disconnected. Finally, the sets A(X) 
and their boundaries appear to depend continuously on X. 

Examples & Exercises 

4.1 . In the above section we applied Algorithm 1 in section 4, with Q = (0, 0), to 
compute a map of the family of dynamical systems 
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fx(z) = 



(z - DA 
(z + DA* 



if Re z > 0, 
if Re z < 0. 



The resulting map was shown in Figure VIII.239. This map contains an unexplored 
region. Repeat the computation, but with (a) Q = 0.5, and (b) Q = -0.5, to obtain 
information about the unexplored region. 

4.2. In this example we consider the family of dynamical systems {C; fx}, where 



The parameter space is X e P = [X e C : 0 < \X\ < 1}. This family is related to the 
family of IFS 



Let A(X) denote the attractor of the IFS and let A (A.) denote the "repelling set" 
associated with the dynamical system. Let 

S = {Xe P : the line X 2 x -X l y=0 separates wi(A(X)) and w 2 (A(X))}. 

IfXeS then [A(X); f x } is the shift dynamical system associated with the IFS, and 
A(X) = A(X). Even when X g S we expect there to be similarities between A(X) and 
A(X). 

In Figures VIII.243, VIII.244, and VIII.245 and Color Plates 19 and 20 we show 
some results of applying Algorithm 1 in section 4 to the dynamical system {C; fx}. 

In Figure VIII.243, the outer white region represents systems for which the orbit 
of the point O does not diverge, and probably corresponds to "repelling sets" with 
nonempty interiors. The inner region, defined by the patchwork of gray, black, and 
white sections, bounded by line segments, represents systems for which the orbit 
of O diverges and corresponds to totally disconnected "repelling sets." The grainy 
gray area is the interesting region. This is the "coastline"; it is itself a fractal object, 
revealing infinite complexity under magnification. Figures VIII.244 and VIII.245 
show magnifications at two places on the coastline. The grainy areas revealed by 
magnification resemble pictures of the repelling set of the dynamical system at the 
corresponding values of X. 

4.3. This exercise refers to the family of dynamical systems {C; z 2 - X}. Use Algo- 
rithm 1 in section 4 with -0.25 < X x < 2, -1 < X 2 < 1, and Q = (0.5, 0.5) to make 
a picture of the "Mandelbrot Set" M (0.5, 0.5). An example of such a set, for a dif- 
ferent choice of Q, is shown in Figure VIII.246. 



fxiz) = 



(z - DA 
(z + DA 



if X 2 x — X\y > 0, 
if X 2 x — X 2 y < 0. 



{C; w x {z) = Xz + 1, w 2 {z) = Xz- 1}. 



4 



Figure VIII.243. A 

map of the family of dy- 
namical systems described 
in example 4.2, computed 
using Algorithm 1 in sec- 
tion 4. The parameter 
space is P = [k e C; 0 < 
k x < 1,0 < k 2 < 1}. The 
gray grainy area is the in- 
teresting region. This is 
the "coastline"; it is itself 
a fractal object, revealing 
infinite complexity under 
magnification. 



Figure VIII. 244. Zoom 
on a small piece of 
the foggy area in Fig- 
ure VIII.242. In it one 
finds grainy areas that re- 
semble the repelling sets 
of the corresponding dy- 
namical systems. At what 
value of k does one find 
them? At the value of k 
in the map where the pic- 
ture you are interested in 
occurs. 
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Figure VIII. 245. Zoom 
on a small piece of 
the foggy area in Fig- 
ure VIII. 242. The grainy 
areas in this picture here 
have different shapes from 
those in Figure VIII.244. 



Figure VI 1 1. 246. A 

"Mandelbrot set" M(zo) 
associated with the family 
of dynamical systems 
{C; z 2 — A}. This was 
computed using escape 
times of orbits of the point 
z = Zo different from the 
critical point, z = 0. 



-4- 



Chapter IX 

Measures on Fractals 

1 Introduction to Invariant Measures on Fractals 

In this section we give an intuitive introduction to measures. We focus on measures 
that arise from iterated function systems in 1R 2 . 

In Chapter III, section 8 we introduced the Random Iteration Algorithm. This 
algorithm is a means for computing the attractor of a hyperbolic IFS in 1R 2 . In order 
to run the algorithm one needs a set of probabilities, in addition to the IFS. 

Definition 1.1 An iterated function system with probabilities consists of an IFS 

{X; wi, w 2 , . . • , w N } 

together with an ordered set of numbers {p\, P2, • ■ • , Pn}, such that 

Pi + Pi + P3 H 1- Pn = 1 and pi > 0 for i = 1, 2, . . . , N. 

The probability /?, is associated with the transformation tu f . The nomenclature 
"IFS with probabilities" is used for "iterated function system with probabilites." The 
full notation for such an IFS is 

{X; w\, w 2 , w N ; p\, p 2 , . . . , Pn}- 

Explicit reference to the probabilities may be suppressed. 
An example of an IFS with probabilities is 

{€; wi(z), w 2 (z), w 3 (z), w 4 (z); 0.1, 0.2, 0.3, 0.4}, 

where 

w x (z) = 0.5z, w 2 (z) = 0.5z + 0.5, 
w 3 (z) = 0.5z + (0.5)/, w 4 (z) = 0.5z + 0.5 + (0.5)/. 
It can be represented by the IFS code in Table IX. 1. The attractor is the filled square 
■, with corners at (0, 0), (1, 0), (1, 1), and (0, 1). 

Here is how the Random Iteration Algorithm proceeds in the present case. An 
initial point, zo e C, is chosen. One of the transformations is selected "at random" 
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Table IX. 1. IFS code for a measure on ■. 



w 


a 


b 


c 


d 


e 


/ 


P 


1 


0.5 


0 


0 


0.5 


1 


1 


0.1 


2 


0.5 


0 


0 


0.5 


50 


1 


0.2 


3 


0.5 


0 


0 


0.5 


1 


50 


0.3 


4 


0.5 


0 


0 


0.5 


50 


50 


0.4 



from the set {w u w 2 , w 3 , w 4 ). The probability that w t is selected is Pi , for , - 
1 2 3 4 The selected transformation is applied to zo to produce a new point Zl e 
C Again a transformation is selected, in the same manner, independently of the 
previous choice, and applied to Zl to produce a new point z 2 . The process is repea ed 
anumber of times, resulting in a finite sequence of points {z n : n = 1, 2, . , numits), 
where numits is a positive integer. For simplicity, we assume that zo e Then, since 
w(u) c ■ for i = 1, 2, 3, 4, the "orbit" {z„ : n = 1, 2, . . . .nM/nitt} lies in - 

Consider what happens when we apply the algorithm to the IFS 1 code m Table 
IX 1 If the number of iterations is sufficiently large, a picture of ■ will be the 
result. That is, every pixel corresponding to . is visited by the "orbit z n : n - 
1 2 numits}. The rate at which a picture of . is produced depends on the prob- 
abilities. If numits = 10, 000, then we expect that because the images of - are just- 
touching, 

the number of computed points in wi(m) « 1000, 
the number of computed points in w 2 (m) « 1000, 
the number of computed points in w 3 (m) « 1000, 
the number of computed points in w 4 (m) « 1000. 
^ese estimates are supported by Figure IX.247, which shows the result of nrnning 
a modified version of Program 2 in Chapter III, with the IFS code m Table IX. 1 , and 

T Filure^x'MS we show the result of running a modified version of Program 2 
in Chapter III, for the IFS code in Table IX. 1, with various choices for the proba- 
bilities In each case we have halted the program after a relatively small number of 
iterations, to stop the image from becoming "saturated." The results are dtversete^ 
tures. in each case the attractor of the IFS is the same set, .. However ^ the pom s 
produced by the Random Iteration Algorithm "rain down' on . with d.ffe ent fre- 
quencies at different places. Places where the "rainfall" is highest appear darke 
or "more dense" than those places where the "rainfall" is lower. In the end all places 
on the attractor get wet. 

The pictures in Figure IX.248 (.MO suggest a wonderful uta. 
that associated with an IFS with probabilities there is a unique denstty on the 
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Figure IX.247. The 

Random Iteration Algo- 
rithm, Program 1 in Chap- 
ter III, is applied to the 
IFS code in Table IX. 1, 
with numits = 100,000. 
Verify that the number of 
points that lie in u>, (■) is 
approximately (numits) p,, 
for / = 1, 2, 3,4. 




attractor of the IFS. The Random Iteration Algorithm gives one a glimpse of this 
"density," but one loses sight of it as the number of iterations is increased. This is 
true, and much more as well! As we will see, the "density" is so beautiful that we 
need a new mathematical concept to describe it. The concept is that of a measure. 
Measures can be used to describe intricate distributions of "mass" on metric spaces. 
They are introduced formally further on in this chapter. The present section provides 
an intuitive understanding of what measures are and of how an interesting class of 
measures arises from IFS's with probabilities. 

As a second example, consider the IFS with probabilities 

{C; w\ (z) = 0.5z + 24 + 24i, w 2 (z) = 0.5z + 24/, w 3 (z) = 0.5z; 0.25, 0.25, 0.5}. 

The attractor is a Sierpinski triangle A. The probability associated with u; 3 is twice 
that associated with either w\ or w 2 . In Figure IX.249 we show the result of applying 
the Random Iteration Algorithm, with these probabilities, to compute 10,000 points 
belonging to A. There appear to be different "densities" at different places on A. For 
example, iy 3 (A) appears to have more "mass" than either wi(A) or iy 2 (A). 

In Figure IX. 250 we show the result of applying the Random Iteration Algorithm 
to another IFS with probabilities, for three different sets of probabilities. The IFS 
is {R 2 ; w\, w 2 , u>2, w 4 }, where wi is an affine transformation for / = 1, 2, 3, 4. The 
attractor of the IFS is a leaf-like subset of R 2 . In each case we see a different pattern 
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Figure IX.248. The 

Random Iteration Algo- 
rithm is applied to the IFS 
code in Table IX. 1, but 
with various different sets 
of probabilities. The result 
is that points rain down on 
the attractor of the IFS at 
different rates at different 
places. What we are see- 
ing are the faint traces of 
wonderful mathematical 
entities called measures. 
These are the true fractals. 
Their supports, the attrac- 
tors of IFS, are merely sets 
upon which the measures 
live. 
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Figure IX.248. (c) 




of "mass" on the attractor of the IFS. It appears that each "density" is itself a fractal 
object. 

Examples & Exercises 

1.1. Carry out the following numerical experiment. Apply the Random Iteration 
Algorithm to the IFS code in Table IX. 1, for numits = 1000, 2000, 3000, .... In 
each case record the number, Af, of computed points that land in B — {(x, y) e 
R 2 : (x — l) 2 + (y — l) 2 < 1}, and make a table of your results. Verify that the ratio 
Af /numits appears to approach a constant. 

1 .2. Repeat the computergraphical experiment that produced Figure IX.247. Verify 
that you obtain "similar-looking" output to that shown in Figure IX.247, even though 
you (probably) use a different random-number sequence. 

1 .3. The Random Iteration Algorithm is used to compute 100,000 points belonging 
to ■, using the IFS code in Table IX. 1. How many of these points, do you expect, 
would belong to w\ o 103(B)? Why? 

Let (X, d) be a complete metric space. Let {X; w\, . . . , p\, . . . , p N ) be an 
IFS with probabilities. Let A denote the attractor of the IFS. Then there exists a 
thing called the invariant measure of the IFS, which we denote here by (jl. fi assigns 
"mass" to many subsets of X. For example, ix(A) = 1 and fi(0) = 0. That is, the 
"mass" of the attractor is one unit, and the "mass" of the empty set is zero. Also 
^i(X) = 1, which says that the whole space has the same "mass" as the attractor of 
the IFS; the "mass" is located on the attractor. 
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Figure IX.249. The 

A« 9 .ft Random Iteration Algo- 

ritnm is used t0 com P ute 

<6^ " £k an image of the Sierpinski 

.^ KA %. /f V "ii>. triangle A. The probabil- 

ity associated with u> 3 is 

twice that associated with 
*\ jfe. A, /\ to, orui 2 . One thousand 

^£ ' ' " ^ ' ' '* '" ^ points have been com- 

puted. The result is that 
»*■ , ^',>. .4 A A a' S"'*- it) 3 (A) appears denser than 

^ it),(A) or it) 2 (A). This ap- 

A 4b> %> pearance is lost when the 

A. J >■ A number of iterations is in- 

S%. creased. We are led to the 

*h ,yh /\ /\ ,'*}. idea of a "mass" or mea- 

' " "*" ^ A, " sure that is supported on 

fi\ \ ^ 7t a" "X the fractal. 



Not all subsets of X have a "mass" assigned to them. The subsets of X that do 
have a "mass" are called the Borel subsets of X, denoted by B(X). The Borel subsets 
of X include the compact nonempty subsets of X, so that H(X) C B(X). Also, if O is 
an open subset of X, then O e B{X). So there are plenty of sets that have "mass." Let 
B denote a closed ball in X. Here is how to calculate the "mass" of the ball, /z(fl). 
Apply the Random Iteration Algorithm to the IFS with probabilities, to produce a 
sequence of points {z w }^l 0 - Let 

M(B, n) = number of points in {zo, zu Zi, Z3. • • • . z n ) H B, for n = 0, 1, 2, 

Then, almost always, 

MB) = lim ' ; . 

«^oo [ (n + 1) J 

That is, the "mass" of the ball B is the proportion of points, produced by the Random 
Iteration Algorithm, which land in B. (To be precise we also have to require that the 
"mass" of the boundary of B is zero; see Corollary 7.1.) 

By now you should be bursting with questions. How do we know that this formula 
"almost always" gives the same answer? What are Borel sets? Why don't all sets 
have "mass"? Welcome Jo measure theory! 

As an example, we evaluate the measure of some subsets of C, for the IFS with 
probabilities 
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Figure IX.250. The Random Iteration Algorithm is used to compute an image of a leaf. 
Different sets of probabilities lead to different distributions of "mass" on the leaf. 



{C; wi (z) = 0.5z, w 2 (z) = 0.5z + (0.5)/, w 3 (z) = 0.5z + 0.5; 0.33, 0.33, 0.34}. 

The attractor is a Sierpinski triangle A with vertices at 0, i, and 1. We compute the 
measures of the following sets: 



Bi = 


{Z€C 


:\z\ 


<0.5} 


B 2 = 


{zgC 


: |z 


- (0.5+0.501 <0.2} 


B 3 = 


{Z€C 


■:\z 


- (0.5 +0.5/)| <0.5} 


B 4 = 


{Z€C 


: |z 


-(2 + 0l<V2}. 



The results are presented in Table IX.2. 

Figure IX.25 1 illustrates the ideas introduced here. 

Examples & Exercises 

1 .4. Explain why fi(B 4 ) ^ 0 in Table IX.2. 

1.5. What value, approximately, would have been obtained for n(B\) in Table 
IX.2, if the probabilities on the three maps had been p\ = 0.275, p 2 = 0.125, and 
p 3 =0.5? 
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Table IX.2. The measures of some subsets of A are computed by random iteration. 



N{B x ,n)/n N{B 2 ,n)/n X(B 3 ,h)/n M{B A ,n)/n 



5,000 


0.3313 


0.1036 


0.6385 


0.0004 


10,000 


0.3314 


0.1050 


0.6500 


0.0002 


15,000 


0.3323 


0.1041 


0.6512 


0.0001 


20,000 


0.3330 


0.1030 


0.6525 


0.0000 


50,000 


0.3326 


0.1041 


0.6527 


0.0000 


100,000 


0.3325 


0.1054 


0.6497 


0.0000 




The "mass" of B is the 
proportion of time 
spent in B 



Random Iteration Algorithm produces 
^JS». a dancing point. 




Figure IX.251. Di- 
agram of the Random 
Iteration Algorithm run- 
ning, and a dancing point 
coming and going from 
the ball B. The "mass" 
or measure of the ball is 
lx{B). It is equal to the 
proportion of points that 
land in B. 



1 .6. Why, do you think, is the phrase "almost always" written in connection with 
the formula for ii(B), given above? 



2 Fields and Sigma-Fields 

Definition 2.1 Let Xbe a space. Let T denote a nonempty class of subsets of a 
space X, such that 

i 

(1) A,B AU B ef; 

(2) Aef^X\Aef. 
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Then T is called a field. 

(In exercise 2.12 you will be asked to prove that X e T '.) 

Theorem 2. 1 Let Xbe a space. Let Q be a nonempty set of subsets ofX. Let T 
be the set of subsets ofX which can be built up from finitely many sets in Q using the 
operations of union, intersection, and complementation with respect to X. Then T is 
a field. 

Proof Elements of T consist of sets such as 

X \ (((X \ (Gi U G 2 )) n G 3 ) U (G 5 n G 6 )), 

where G u G 2 , G 3 , G 3 , . . . denote elements of Q. That is, T is made of all those 
sets that can be expressed using a finite chain of parentheses, \, U, n, elements of 
Q, and X. (In fact, using de Morgan's laws one can prove that it is not necessary to 
use the intersection operation.) If we form the union of any two such expressions 
we obtain another one. Similarly, if we form the complement of such an expression 
with respect to X, we obtain another such expression. So conditions (i) and (ii) in 
Definition 2.1 are satisfied. This completes the proof. 

Definition 2.2 The field referred to in Theorem 2.1 is called the field generated 
byQ. 

Examples & Exercises 

2.1. Let X be a space and let A c X. Then T = {X, A, X \ A, 0} is a field. 

2.2. Let X be the set of all leaves on a certain tree and let T be the set of all subsets 
of X. Then T is a field. Let A denote the set of all the leaves on the lowest branch of 
the tree. Then AeT. Prove that T is generated by the leaves. 

2.3. Let X = [0, 1] c R. Let Q denote the set of all subintervals (open, closed, half- 
open) of [0, 1]. Let T denote the field generated by Q. Examples of members of T 
are [0.5, 0.6) U (0.7, 0.81); [0, 1]; [1, 1]; and (I, 1) U (±, I) U • • • U (^, ±). Show 
that 

*feW)-G-0«4;K>~ 

is a subset of X but it is not a member of T. 

2.4. Let X = ■ c R 2 . Let Q denote the set of closed rectangles contained in X, 
whose sides are parallel to the coordinate axes and whose corners have rational 
coordinates. Let T denote the field generated by Q. An example of an element of 
T is 

((■ \ ((■ \ R x ) U R 2 ) n R 3 ) U (R 4 n (■ \ R 5 )), 

where R u R 2 , R 3 , R4, and R 5 are rectangles in Q. Let S e T. Prove that the area of 
S is a rational number. Deduce that T does not contain the ball B(0, \) = {( x ,y)e 
m: x 2 + y 2 < 1}. 
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2.5. Let X denote the set of pixels corresponding to a certain computer graphics 
display device. The set of all monochrome images that can be produced on this 
device forms a field. Figure IX.252 shows an example of a small field of subsets of 
X. It is generated by the pair of images, G x and G 2 , in the middle of the second row, 
together with the set X. X is represented by the black rectangle. The empty set is 
represented by a blank screen. Find formulas for all of the images in Figure IX.252, 
in terms of G\, G 2 , and X. 

2.6. Let E denote the code space on two symbols 1 and 2. Let n e { 1 , 2, 3, . . .} and 
g,-G {1,2} for / = 1,2, ...,rc.Let 

COi, e 2 , . . . , e n ) = {x e E : x t = e, for i = 1, 2, . . . , n}. 

Any subset of E that can be written in this form is called a cylinder subset of E. Let 
T denote the field generated by the cylinder subsets of E. Find a subset of E that is 
not in T. 

2.7. Let X be a space. Let T denote the set of all subsets of X. The customary 
notation for this field is T = 2 X . Show that T is a field. 

Definition 2.3 L$t Tbea field such that 

Ai e T for / e I, 2, 3, . . . A,- e T. 
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Then T is called a a -field (sigma-field). 

Given any field, there always is a minimal, or smallest, a -field which contains it. 

Theorem 2.2 Let Xbea space and let Q be a set of subsets ofX. Let {F a : a e 
1} denote the set of all a -fields on X which contain Q. Then T — ^ a T a is a a -field. 

Proof Note that there is at least one a -field that contains Q, namely 2 X , the field 
consisting of all subsets of X. We have to show that D a J" a is a a -field if each T a is a 
a -field that contains Q. Suppose that A,- e n a F a ; then, for each a, A,- is an element 
of the a -field T a and so U^,A, e T a . Suppose A e n a J" a ; then, for each a,AeT a 
and so X \ A e T a . Hence X \ A e n a J" a . This completes the proof. 

Definition 2.4 Let Qbea set of subsets of a space X. The minimal a -field which 
contains Q, defined in Theorem 2.2, is called the a -field generated by Q. 

Definition 2.5 Let (X, d) be a metric space. Let B denote the a -field generated 
by the open subsets ofX.B is called the Borel field associated with the metric space. 
An element ofB is called a Borel subset ofX. 

The following theorem gives the flavor of ways in which the Borel field can be 
generated. 

Theorem 2.3 Let (X, d) be a compact metric space. Then the associated Borel 
field B is generated by a countable set of balls. 

Proof We prove a more general result first. Let Q = {b n cX:n = 1,2, 3, 
...;b n open } be a countable base for the open subsets of X. That is, every open 
set in X can be written as a union of sets in Q. Then B is generated by Q. To see 
this, let B denote the cr -field generated by Q. Then BcB because Q is contained in 
the set of open subsets of X. On the other hand, B cB because B contains all the 
generators of B. Hence B = B. 

It remains to construct a countable base for the open subsets of X using balls. For 
R > 0 let 

B(x,R) = {y eX:d(x,y) < R}. 

Let n be a positive integer. Then X = U x€X B(x, £). Hence {B(x, : x e X} is an 
open covering of X. Since X is compact it contains a finite subcovering {B(x%\ : 
m = 1,2, ... , M{ri)} for some integer M(n). We claim that 

V = {B(x%\ -).m = l,2,..., M{n); n = 1, 2, 3, . . .} 
n 

is a countable base for the open subsets of X. Let O be an open subset of X, and let 
x e O. Then there is an open ball, of radius R > 0, such that B(x, R) c O. Let n be 
large enough that \ < f . Then there is m e {1, 2, . . . , M(ri)) so that x is in the ball 
B(x in) -), and this ball is contained in O. Each x in O is contained in such a ball, 
belonging to V. Hence V is indeed a countable base for the open subsets of X. This 
completes the proof. 
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Examples & Exercises 

2.8. Let B denote the a -field generated by the field in exercise 2.4. Then B contains 
the ball B(0, 1). Similarly it contains all balls in ■ c K 2 . Show that B is the Borel 
field associated with ( ■, Manhattan). 

2.9. Let E denote the code space on the two symbols {0, 1}. Show that the Borel 
field associated with (E, code space metric) is generated by the cylinder subsets of 
E, defined in exercise 2.5. 

2.10. Let AcK 2 denote a Sierpinski triangle. Let Q denote the set of connected 
components of R 2 \ A. Let T denote the a -field generated by Q. Show that T is 
contained in, but not equal to, the Borel field associated with (R 2 , Euclidean). 

2. 1 1 . Let X be a space and let Q be a set of subsets of X. Let T\ be the field 
generated by Q, let T2 be the ct -field generated by Q, and let T3 be the <r -field 
generated by T\. Prove that Ti = T 2 . 

2.1 2. Let T be a field of subsets of a space X. Prove that Xef. 



3 Measures 

A measure is defined on a field. Each member of the field is assigned a nonnegative 
real number, which tells us its "mass." 

Definition 3.1 A measure \x, on a field T , is a real nonnegative function \x : 
T -> [0, 00) c R, such that whenever A, G T for i = 1 , 2, 3, . . ., with A t n A } ■■ = 0 
for i ^ j and U^jA,- G T, we have 

00 

/*(U,~iA,-) = X>(A,-). 

1=1 

(In other texts a measure as defined here is usually referred to as a finite measure.) 

Definition 3.2 Let (X, d) be a metric space. Let B denote the Borel subsets of 
X. Let fi be a measure on B. Then /x is called a Borel measure. 

Some basic properties of measures are summarized below. 

Theorem 3. 1 Let T be a field and let \x \T -± R be a measure. Then 

(1) IfB D A, then fi(B) = fi(B \ A) + fi(B),for A, B G T; 

(2) IfB D A, then fi(B) > /x(A); 

(3) /x(0) = 0; 

(4) If Ai G Tfor i = 1, 2, 3, . . ., and Uf^A,- g T, then /x (Uf^A,) < 

(5) If {At g T\ obeys A x c A 2 C A 3 C . . ., and if U^jA/ g T, then /x(A f ) 
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(6) If {Ai g T) obeys A x D A 2 D A 3 D . . .,.and if n£,A,- e T, then fi(A t ) -+ 

Proof [Rudin 1966] Theorem 1.19, p. 17. These are fun to prove for yourself! 

We are concerned with measures on compact subsets of metric spaces such as 
(K 2 , Euclidean). The natural underlying a -field is the Borel field, generated by the 
open subsets of the metric space. The following theorem allows us to work with the 
restriction of the measure to any field that generates the a -field. 

Theorem 3.2 [Caratheodory] Let [jl denote a measure on a field T. Let T 
denote the o -field generated by T . Then there exists a unique measure \xon T such 
that fji(A) = pL(A)for all A € T. 

Sketch of proof The proof can be found in most books on measure theory; see 
[Eisen 1969] Theorem 5, p. 180, Chapter 6, for example. First fi is used to define an 
"outer measure" fi° on the set of subsets of X. fjP is defined by 

fi°(A) = inf | v(A n ) : A c U~ , A H , A n e FVn e 2+ J . 

/a 0 is not usually a measure. However, one can show that the class of subsets A 
of X such that — this was Caratheodory 's smart idea — 

fi°(E) = fJ°(A HE) + At°((X \A)DE) for all £e2 x 

is a a -field that contains T . One can also show that fi° is a measure on ^ r0 . Note that 
•F 0 D T. fi is defined by restricting fi° to T. Finally one shows that this extension of 
fjL to T is unique. This completes the sketch. 

In the above sketch we have discovered how to evaluate the extended measure (i 
in terms of its values on the original field. 

Theorem 3.3 Let a measure /a on a field T be extended to a measure /t on the 
minimal o -field T that contains T . Then, for all A ^ T , 

A(A) = inf |]P fi(A n ) : A c U~ x A n , A n e FVn = 1, 2, . . . j . 

Examples & Exercises 

3. 1 . Consider the field T = {X, A , X \ A , 0}, where A / X and A / 0. A measure 
fi : T -+ D& is defined by fi(X) = 7.2, fi(A) = 3.5, fi(X \ A) = 3.7, and ai(0) = 
0. T is also a a -field. The extension of the measure promised by Caratheodory 's 
theorem is just the measure itself. 

3.2. Let T be the field made of sets of leaves on a certain tree, at a certain instant 
in time, and let /jl(A) be the number of aphids on all the leaves in A e T. Then [i is 
a measure on a finite a -field. 
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3.3. Let X = [0, 1] C R. Let T be the field generated by the set of subintervals 
of [0, 1]. Let a, b g [0, 1] and define p((a, b)) = p([a, b])=b- a, for a < b; and 
more generally let 

/x (element of T) = sum of lengths of subintervals which comprise the element. 

Show that ^ is a measure on T. The a -field T generated by T is the Borel field for 
([0, 1], Euclidean). Show that 5 = {x g [0, 1] : x is a rational number } belongs to T 
but not to T. Evaluate p(S), where p is the extension of p to T. 

3.4. Let X = E , the code space on the two symbols 1 and 2. Let T denote the field 
generated by the cylinder subsets of E, as defined in exercise 2.5. Let 0 < p\ < 1 
and p2 = 1 — p\. Define 

p(C(e\, e 2 , e n )) = p ei Pe 2 ■ ■ ■ Pe n , 

for each cylinder subset C{e x , e 2 , . . . , e n ) of E. Show how p. can be defined on the 
other elements of T in such a way as to provide a measure on T. Evaluate 

n({x G E :x 7 = 1}) andAt(E). 

Extend T to the field T generated by T, and correspondingly extend p, to /t- Show 
that 

S = {x g E : jCodd = 1} e^" 

and evaluate 

3.5. This example takes place in the metric space ([0, 1], Euclidean). Consider the 
IFS with probabilities 

1 12, 
{[0, 1]; wi(x) = -x, w 2 {x) = 3* + 3' Pi> Pd- 

Let T denote the field generated by the set of intervals that can be expressed in the 
form 

tUgj o w e2 o . . . o Wg n ([0, 1]), 

where n g {1, 2, . . .} and e t g {1, 2} for each i = 1, 2, . . . , n. Let 0 < p\ < 1 and 
p 2 = 1 - p x . Show that one can define a measure on T so that, for every such 
interval, 

p(W ei OW e2 0---0 W en ([0, 1])) = p ei Pe 2 • ■ ■ Pe n - 

Let A denote the attractor of the IFS. Evaluate p(A), p(X \ A), and p([\, f ]). 

3.6. What happens in exercise 3.5 if the IFS is replaced by 

{[0, 1]; iwi(x) = i*, w 2 (x) = + i; pi, PiV- 

For what value of p\ is the extension of the measure to the a -field generated by T 
the same as the Borel measure defined in exercise 3.3? 
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Definition 3.3 Let (X, d) be a metric space, and let ixbe a Borel measure. Then 
the support of /z is the set of points x e X such that /z(fi(x, e)) > 0 for all e > 0, 
where B(x, e) = {y e X : d(y, x) < e}. 

The support of a measure is the set on which the measure lives. The following is 
an easy exercise. 

Theorem 3.4 Let (X, d) be a metric space, and let ixbe a Borel measure. Then 
the support of /z is closed. 

Examples & Exercises 

3.7. Let (X, d) be a compact metric space and let /z be a Borel measure on X such 
that /z(X) / 0. Show that the support of /z belongs to H(X), the space of nonempty 
compact subsets of X. 

3.8. Prove the following. "Let /z be a measure on a cr-field T, and let T be the 
class of all sets of the form AU B where A e T and B is a subset of a set of measure 
zero. Then T is a cr-field and the function Jl : T -* R defined by Jl(A U B) — ji(A) 
is a measure." The measure Jl referred to here is called the completion of /z. The 
completion of the measure in exercise 3.3 is called the Lebesgue measure on [0, 1]. 



4 Integration 



In the next section we will introduce a remarkable compact metric space. It is a space 
whose points are measures! In order to define the metric on this space we need to 
be able to integrate continuous real-valued functions with respect to measures. Can 
one integrate a continuous function defined on a fractal? How does one evaluate the 
"average" temperature of the coastline of Sweden? Here we learn how to integrate 
functions with respect to measures. Let (X, d) be a compact metric space. Let /z be 
a Borel measure on X. Let / : X -» R be a continuous function. We will explain the 
meaning of integrals such as 

/ f(x)d(x(x). 
Jx 

Definition 4.1 We reserve the notation xa for the characteristic function of a 
set A C X. It is defined by 

1 for x e A, 
0 for x e X \ A. 

A function f : X -> R is called simple if it can be written in the form 

N 

f(x) = ^T t yiXi i (x), 



Xa(x) 
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where N is a positive integer, k e B and y, e R for i = 1, 2, . . . , AT, uj^/,- = X, 
and /,n/ ; =0 fori ^ j. 

The graphs of several simple functions, associated with different spaces, are 
shown in Figures IX.253 and IX.254. 

Definition 4.2 The integral (with respect to fi) of the simple function f in 
Definition 4.1, is 

f C N 

/ f(x)dfjL(x)= / fd f M = j2y i ^ Ii) - 

Jx Jx ,=i 

Examples & Exercises 

4.1. Let / : [0, 1] -> R be a piecewise constant function, with finitely many dis- 
continuities. Show that / is a simple function. Let [i denote the Borel measure on 
[0, 1] such that fi(I) = length of /, when / is a subinterval of [0, 1]. Show that 

f f(x)dx= f f(x)d i i(x), 

J0 J [0,1] 

where the right-hand side denotes the area under the graph of /. 

4.2. This example takes place in the metric space (■, Euclidean). Let Q denote the 
set of rectangular subsets of ■. Let T denote the field generated by Q. Show that 
there is a unique measure /xonf such that fi(A) = area of A, for all A e Q. Notice 
that the <r -field generated by T is precisely the Borel field B associated with (■, 
Euclidean). Let ji denote the extension of fi to B. Let A denote a Sierpinski triangle 
contained in ■. Show that A eB, and 

j XA = A) = 0. 

4.3. This example concerns the IFS with probabilities 

{C; wi(z), w 2 (z), w 3 (z)\ Pi = 0.2, p 2 = 0.3, p 3 = 0.5}, 

where 

wi(z)=0.5z, w 2 (z)=0.5z + (0.5)i, w 3 (z) = 0.5z + 0.5. 

Let A denote the attractor of the IFS, and B the .Borel subsets of (A, Euclidean). Let 
li denote the unique measure on B such that 

/z(A) = l 
fi(w i (A)) = p i fori g{1,2,3}; 
fi(Wi o Wj(A)) = PiPj for i, j e {1, 2, 3}; 



fi(Wi oWj---o w k (A)) = piPj ...p k for i,j,...,ke {1, 2, 3}; 
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Figure IX.253. The 

graph of a simple function 
on a Sierpinski triangle. 
The domain is a Sierpinski 
triangle in the (x, y) 
plane. The function values 
are represented by the z- 
coordinates. 
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Define a simple function on A by 

II for x + iy e A and 1/3 < x < 1, 
-1 for x+iyz A and 0 < x < 2/3. 

Calculate f(z)dii(z), accurate to two decimal places. 

Based on the ideas in section 1 of this chapter, can you guess a method for 
calculating the integral that makes use of the Random Iteration Algorithm? Try it! 

4.4. Show that if a, p e R and /, g are simple functions then af + fig is a simple 
function, and 

a S fdn + P f gdn = I (af + fig)dfi. 

4.5. Black ink is printed to make this page. Let ■ c 0& 2 be a model for the page, 
and represent the ink by means of a Borel measure /x, so that is the mass of ink 
associated with the set A cm. Let A^T denote the smallest Borel set that contains 
all of the letters "a"on the page. Assume that the total mass of ink on the page is one 
unit. Estimate f m XAd/A- 

4.6. Let E denote code space on two symbols {1,2}. B denotes the Borel field 
associated with (E, code space metric). Consider the IFS {E; wi(x) = Ix, w 2 (x) = 
2x; pi = 0.4, p 2 = 0.6}, where "lx" means the string "lxix 2 x 3 . . ." and "2x" means 
the string "2x^2x3 . . .." The attractor of the IFS is E. Let /x denote the unique 
measure on B such that 

fi(Wi oWj---o Wfc(E)) = piPj ... pk for /, j, . . . , k G {1, 2}. 

Define sets A and B in B by 

A = {x g B : x { = 0} and B = {x e B : x 2 = 1}. 
Define / : E -> D& by 

/(*) = + (2.3)*b(*) for alU e E. 
Evaluate the integral 

Definition 4.3 Ler (X, d) be a compact metric space, and let B denote the 
associated Borel field. Let 11 be a Borel measure. A partition ofX is a finite set of 
nonempty Borel sets, {A t e B : i = 1, 2, . . . , M), such that X = Uff^A,, and A t n 
Aj = id for i ^ j. The diameter of the partition is max{sup{d(>, y) :x,y& A t } : i = 
1,2, ...,M}. 

Theorem 4.1 Let (X, d) be a compact metric space. Let B denote the asso- 
ciated Borel field. Let ji be a Borel measure on X. Let f : X -> D& be continuous, 
(i) Let n be a positive integer. Then there exists a partition B n = {A„, m g8 : m = 
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1,2,..., M(n)} of diameter l/n. (ii) Letx n , m e A n , m form = 1, 2, 3, . . ., and define 
a sequence of simple functions by 

M(n) 

fn(x) = f(x n , m )XA n Jx) far H = 1, 2, 3, ... . 

m=l 

Then {/„} converges uniformly to f(x). (Hi) The sequence {f x f n dfi) converges, (iv) 
The value of the limit is independent of the particular sequence of partitions, and of 
the choices of x n , m e A n , m . 

Sketch of proof 

(i) Since X is compact it is possible to cover X by a finite set of closed balls 
of diameter l/n, say b nA , b n , 2 , b n ,M(n)- We can assume that each ball 
contains a point not in any of the other balls. Then define A n ,i =b nt \, and 
A nJ = b n j \ U^jA^fc, for j = 2, 3, . . . , M(n). Then B n = {A„, m e B : m = 
1,2,..., M(n)} is a partition of X of diameter l/n. 

(ii) Let e > 0. / is continuous on a compact space, so it is uniformly continuous. 
It follows that there exists an integer N(e) so that if x, y e X and d(x, y) < 
l/N(e) then |/(jc) - /(y)| < e. It follows that \ f(x) - f n (x)\ < € when 
n > N(e). 

(iii) It is readily proved that {/ x f n d{i\ is a Cauchy sequence. Indeed, for all 
n,m > N(e) we have 

\f fndii- f f m dn\< f \f n - f m \dfi<2eii(X). 

It follows that the sequence converges. 

(iv) Let {/„} be a sequence of simple functions, constructed as above. Then there 
is an integer N(e) such that \f(x) - f n (x)\ < e when n > N(e). It follows 
that for all n > max{N(e), N(e)}, 

If fndfl- f ~fndfl\< f \f n -f n \dfl<2€fl(X). 
I J X J X I "/X 

This completes the sketch of the proof. 

Definition 4.4 The limit in Theorem 4.1 is called the integral of f (with respect 
to fi).It is denoted by 

lim / f n d{L= I fdfi. 
n ^°° Jx Jx 

Examples & Exercises 

4.7. Let (X, d) be a metric space. Let a e X. Define a Borel measure 8 a by 8 a (B) = 
1 if a e B and 8 a (B) = 0 if a & B, for all Borel sets B c X. This measure is referred 
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to as a "a delta function" and "a point mass at a." Let / : X -> R be continuous. 
Show that 

f f(x)d8 a (x) = f(a). 
Jx 

4.8. This example takes place in the metric space (■, Euclidean). Let /x be the 
measure defined in exercise 4.2, and define / : ■ -> R by /(jc, y) = x 2 + 2xy + 3. 
Evaluate 




4.9. Make an approximate evaluation of the integral f A x 2 dfi(x) where fi and A 
are as defined in exercise 4.3. 

4.10. Let X denote the set of pixels corresponding to a certain computer graphics 
display device. Define a metric donXso that (X, d) is a compact metric space. Give 
an example of a Borel subset of X and of a nontrivial Borel measure on X. Show that 
any function / : -X -> R is continuous. Give a specific example of such a function, 
and evaluate f x f d\x. 



5 The Compact Metric Space CP(X), J) 

We introduce the most exciting metric space in the book. It is the space where 
fractals really live. 

Definition 5. 1 Let (X, d) be a compact metric space. Let /x be a Borel measure 
on X. ///x(X) = 1, then /x is said to be normalized. 

Definition 5.2 Let (X, d) be a compact metric space. Let V(X) denote the set 
of normalized Borel measures on X. The Hutchinson metric d H on V(X) is defined 
by 

d H (l^, v) = sup jy fdfi- j f dv: f : X ^ Rf continuous, 
\f(x) - f(y)\<d(x,y)Vx,yeX 

for all fi, v e V(X). 

Theorem 5.1 Let (X, d) be a compact metric space. Let V(X) denote the set 
of normalized Borel measures on X and let d H denote the Hutchinson metric. Then 
CP(X), du) is a compact metric space. 

Sketch Of proof A direct proof, using the tools in this book, is cumbersome. 
It is straightforward to verify that d H is a metric. It is most efficient to use the 
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concept of the "weak topology" on V(X) to prove compactness. One shows that this 
topology is the same as the one induced by the Hutchinson metric, and then applies 
Alaoglu's theorem. See [Hutchinson 1981] and [Dunford 1966]. 

Examples & Exercises 

5.1 . Let K be a positive integer. Let X = {(/, j) : i, j = 1, 2, . . . , K). Define a 
metric on X by d((i\, (12, ji)) = l'i — ii\ + \j\ — Then (X, d) is a compact 
metric space. Let \x e V(X) be such that (i((i, j)) = (i + j)/(K 3 + K 2 ) and let v e 
V(X) be such that v(i, j) = l/K 2 , for all /, j e {1, 2, . . . , N}. Calculate d H (ii, v). 

5.2. Let (X, d) be a compact metric space. Let fi e V(X). Prove that the support of 
[x belongs to H(X). 



6 A Contraction Mapping on V(X) 

Let (X, d) denote a compact metric space. Let B denote the Borel subsets of X. Let 
w : X -> X be continuous. Then one can prove that w~ x : B -> B. It follows that if v 
is a normalized Borel measure on X then so is v o w~ x . In turn, this implies that the 
function defined next indeed takes V(X) into itself. 

Definition 6.1 Let (X, d) be a compact metric space and let V(X) denote the 
space of normalized Borel measures on X. Let 

{X; wu W2, ■ ■ ■ , w N ; p\, p 2 , . . . , Pn] 

be a hyperbolic IFS with probabilities. The Markov operator associated with the IFS 
is the function M : V(X) -> V(X) defined by 

M(v) = p\v o + P2V o H h Pnv o w^ 1 

for alive VOL). 

Lemma 6. 1 Let M denote the Markov operator associated with a hyperbolic 
IFS, as in Definition 6.1. Let f : X — > R be either a simple function or a continuous 
function . Let v e V(X). Then 

f fd(M(v)) = Y]Pi f fowidv. 
Jx ~[ Jx 

Proof Suppose that / : X — > R is continuous. By Theorem 5.1 we can find a 
sequence of simple functions {/„} which converges uniformly to /. Let n be a 
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positive integer. It is readily verified that 



f f n d(M(v)) = f 



fn dv O W] 



-1 



N , 



V O W: 



-1 




n 



o wi dv. 



The sequence {/ f n d(M(v))} converges to / fd{M (v)). 

For each i e {1, 2, . . . , N} and each positive integer n, /„ o w t is a simple func- 
tion. The sequence {/„ o iw,-}~ j converges uniformly to / o w t . It follows that 
{/ /„ o Widv)^L j converges to / / o iw,-. It follows that Pi / /« ° »Mv}~ , 

converges to Ya=\ Pif f° w i dv - This com P letes the P roof - 

Theorem 6. 1 Let (X, d) be a compact metric space. Let 



be a hyperbolic IFS with probabilities. Let s e (0, 1) be a contractivity factor for 
the IFS. Let M : V(X) V(X) be the associated Markov operator. Then M is a 
contraction mapping, with contractivity factor s, with respect to the Hutchinson 
metric on V(X). That is, 

d H (M(v), M{^)) < sd H (v, il) for all v, [i e V(X). 
In particular, there is a unique measure /x e V(X) such that 

Mfi = [i. 

Proof Let L denote the set of continuous functions / : X R such that I/O) - 
f(y)\ < d{x, y)Vx, yeX. Then 



•Let / = s- 1 £f=i pif o Wi . Then / e L. Let L = {/ € L : / = s~ l Pif ° ">» 
some f e L}. Then we can write 



{X; u>i, w 2 , . . . , w N ; p\, P2, • • • , Pn) 




(Af(v),M(/Lt)) = sup{j / fdfM-s 



fdv : / € L). 



Since L C L, it follows that 



1 d H (M(v), M{n)) < sd H (v, ii). 



This completes the proof. 
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Definition 6.2 Let fi denote the fixed point of the Markov operator, promised by 
Theorem 6.1. \x is called the invariant measure of the IFS with probabilities. 

We have arrived at our goal! This invariant measure is the object we discussed 
informally in section 1 of this chapter. Now we know what fractals are. 

Examples & Exercises 

6. 1 . Verify that the Markov operator associated with a hyperbolic IFS on a compact 
metric space indeed maps the space into itself. 

6.2. This example uses the notation in the proof of Theorem 6.1. Let / e L and let 
/ = ££Li Pif ° w i- Prove that f e L. 

6.3. Consider the hyperbolic IFS 

{■C M 2 ; w\, w 2 , w 3 , w 4 ; p u p 2 , p 3 , p*} 

corresponding to the collage in Figure IX.255(a). Let M be the associated Markov 
operator. Let ixq e V(X), so that fio(m) = 1. For example, hq could be the uni- 
form measure, for which ijlq(S) is the area of S e V(m). We look at the sequence 
of measures = M on (fi 0 )}. The measure ii\ = M(no) is such that fi(wi(m)) = /?, 
for i = 1, 2, 3, 4, as illustrated in Figure IX.255(b). It follows that \x 2 = M o2 O 0 ) 
obeys fi(wi o Wj(w)) = pipj for /, j = 1, 2, 3, 4, as illustrated in Figure IX.255(c). 
We quickly get the idea. When the Markov operator is applied, the "mass" in 
a cell * = W{ o wj o ■ ■ ■ o w k (m) is redistributed among the four smaller cells 
u>i(*ij...k),W2(*ij...k),u)3(mij mmm k), and iu 4 (■,; .*). Also, mass from other cells is 
mapped into subcells of mtj. m , k in such a way that the total mass of * remains 
the same as before the Markov operator was applied. In this manner the distribution 
of "mass" is defined on finer and finer scales as the Markov operator is repeatedly 
applied. What a wonderful idea. We have also illustrated this idea in Figures IX.256 
and IX.257. 

6.4. Apply the Random Iteration Algorithm to an IFS of the form considered in 
example 6.3. Choose the probabilities p\, p 2 , p 3 , and p 4 so as to obtain a "picture" 
of the invariant measure that would occur at the end of the sequence that commences 
in Figure IX.257(a), (b), (c), and (d). 

6.5. Consider the IFS 

{[0, 1] c R; wi(x) = (0.5)*, w 2 (x) = (0J)x + 0.3; p x = 0.45, p 2 = 0.55}. 

The attractor of the IFS is [0, 1]. Let M denote the associated Markov operator. 
Let fi 0 e V([0, 1]) be the uniform measure on [0, 1]. In Figure IX.258(a), /jl 0 is 
represented by a rectangle, whose base is [0, 1] and whose area is 1. The successive 
iterates M(^ 0 ), M o2 Oo), M o3 Oo) are represented in Figure IX.258(b), (c) and (d). 
Each measure is represented by a collection of rectangles whose bases are contained 
in the interval [0, 1]. The area of a rectangle equals the measure of the base of the 
rectangle. Although the sequence of measures converges {M 0 "^)} in the metric 
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(b) 
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P 4 


P 3 


P l 


P 2 



Figure IX.255. A col- 
lage for an IFS of four 
maps. The attractor of the 
IFS is ■, and the prob- 
ability of the map tu, 
is pi for i = 1,2,3,4. 
Let M denote the as- 
sociated Markov oper- 
ator. Let no = 1 • Then 
ix i = M(no) is a measure 
such that iA(u)i(M)) = /?, 
for i = 1, 2, 3, 4, as illus- 
trated in (b). The measure 
fx 2 = M° 2 (ix 0 ) is such that 
/x(WiO Wj(M)) = pipj for 
i, j = 1,2,3, 4, as illus- 
trated in (c). See also Fig- 
ures IX.256 and IX.257. 



Figure IX.255. (b) 
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Figure IX.255. (c) (c) 
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space^QO, 1], dn), some of the rectangles would become infinitely tall as n tends 
to infinity. 

6.6. Make a sequence of figures, analagous to Figure IX.258(a)-(d), to represent 
the Markov operator applied to the uniform measure /x 0 , for each of the following 
IFS's with probabilities: 

(0{[0, 1] C R; iui(jc) = (0.5)*, w 2 (x) = (0.5)* + 0.5; p x = 0.5, p 2 = 0.5}; 
0'/){[0, 1] C R; iui(jc) = (0.5)x, w 2 (x) = (0.5)x + 0.5; 

pi =0.99, p 2 = 0.01}; 
(iii){[0, 1} c R; wi(x) = (0.9)x, w 2 (x) = (0.9)x + 0.1; p x = 0.45, p 2 = 0.55}. 
In each case describe the associated invariant measure. 

6.7. Let X = {A, B, C} denote a space that consists of three points. Let B denote 
the a -field that consists of all subsets of X. Consider the IFS with probabilities 

{X; w\, w 2 ; pi = 0.6, p 2 = 0.4}, 

where u>i : X — > X is defined by w\{A) = B, wi(B) = B, w\(C) = B, and w 2 : 
X X is defined by w 2 (A) = C, w 2 (B) = A, and w 2 (C) = C. Let V(X) denote the 
set of normalized measures on B. Let i±q e V(X) be defined by /x 0 (A) = ^q(B) = 
fi 0 (C) = |. Let M denote the Markov operator associated with the IFS, and let 
jji n = M on (fio) for n = 1, 2, 3, — Determine real numbers a, b, c, d, e, f, g, h, 
i such that for each n, 
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MASS IN CELL IS 
DISTRIBUTED TO 
SMALLER CELLS 





MASS FROM 
ELSEWHERE 
MAKES UP 
THE LOSS 




Figure IX.256. This il- 
lustrates the action of the 
Markov operator on one 
of the sequence of mea- 
sures {M°"(/x 0 )}, where 
fi 0 (W) = 1. When the 
Markov operator is ap- 
plied, the "mass" in a cell 

■(';'...* = U)i 0 U)j O ■ ■ ■ O 

w k (M) is redistributed 
among the four cells 
w i(b, 7 ...,0, w 2 (m,j k ), 
u; 3 (■,; .*), and tu 4 (B, 7 ..,0. 

Also, mass from other 
cells is mapped into sub- 
cells of Wjj k in such a 
way that the total mass of 
■,-_/...* remains the same as 
before the Markov oper- 
ator was applied. In this 
manner the distribution 
of "mass" is defined on 
finer and finer scales as 
the Markov operator is 
repeatedly applied. 



4 
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Figure IX.257. This 
sequence of figures repre- 
sents successive measures 
produced by iterative ap- 
plications of a Markov 
operator of the type con- 
sidered in Figures IX. 255 
and IX.256. The result 
of one application of the 
operator to the uniform 
measure on ■ is repre- 
sented in (a). Figures (b), 
(c), and (d) show the re- 
sults of further successive 
applications of the Markov 
operator. The measures are 
represented in such a way 
as to keep the total number 
of dots constant. The mea- 
sure of a set corresponds 
approximately to the num- 
ber of dots it contains. 
This represents the first 
few of a sequence of mea- 
sures that converges in the 
metric space (V(U),d H ) 
to the invariant measure of 
the IFS. 
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Figure IX.257. (c) 



Figure IX.257. (d) 
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Figure IX.258. This 
sequence of images re- 
lates to the IFS {[0, 1] C 
R; w x (x) = (0.5)x,w 2 (x) 
= (0.7)x + 0.3, p\ = 
0.45, pi = 0.55}. The at- 
tractorofthelFS is [0, 1]. 
Let M denote the associ- 
ated Markov operator. Let 
/x 0 e V({0, 1]) be the uni- 
form measure on [0, 1]. 
The successive iterates 
M(/x 0 ), M o2 (/x 0 ), (a) 
Af o3 (/x 0 ), and M o4 (/x 0 ) 
are represented in parts 
(a),(b),(c), and (d). Each 
measure is represented by 
a collection of rectangles 
whose bases are contained 
in the interval [0, 1]. The 
area of a rectangle equals 
the measure of the base of 
the rectangle. 



(b) 



(c) 
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Figure IX.258. (d) 



Let M denote the 3 x 3 matrix here. Explain how M is related to M, and show that 
the invariant measure of the IFS can be described in terms of an eigenvector of M. 

6.8. Let 

{X; wi, w 2 , w N ; p u p 2 , . . . , p N } 

be a hyperbolic IFS with probabilities. Let /x denote the associated invariant mea- 
sure. Let A denote the attractor of the IFS. Let /x 0 e V(X) be such that /x 0 (A) = 1. 
By considering the sequence of measures {/x„ = M 0,l (/xo)}, prove that 

fi(wi o wj o ■ ■ ■ o w k (A)) > pipj ... p k , for all /, j, . . . , k e {1, 2, . . . , N). 

Show that if the IFS is totally disconnected then the equality sign holds. 

Theorem 6.2 Let (X, d) be a compact metric space. Let 

{X; w\, w 2 , w N ; p u p 2 , . . . , p N } 

be a hyperbolic IFS with probabilities. Let /x be the associated invariant measure. 
Then the support of \x is the attractor of the IFS {X; wi, w 2 , . . . , w N }. 

Proof Let B denote the support of /x. Then B is a nonempty compact subset of 
X. Let A denote the attractor of the IFS. Then 

{A; w u w 2 , w N \ p u p 2 , p N ) 

is a hyperbolic IFS. Let v denote the invariant measure of the latter. Then v is also an 
invariant measure for the original IFS. So, since /x is unique, v = \x. It follows that 
Be A. 

Let a e A. Let O be an open set that contains a. We will use the notation of 
Theorem 2.1 in Chapter IV. Let E denote the code space associated with the IFS 
and let a e E denote the address of a. It follows from Theorem 2.1 in Chapter IV 
that lim^oo 0(cr, n, A) = a, where the convergence is in the Hausdorff metric. It 
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follows that there is a positive integer n so that 0(or, n, A) C O. But 

/X(0((T, «, A)) > p ax Pa 2 ■ ■ ■ Po„ > 0. 

It follows that /z(0) > 0. It follows that a is in the support of /x. It follows that 
a e B. It follows that A c J5. This completes the proof. 

Theorem 6.3 The Collage Theorem for Measures.. Let 

{X; w\, w 2 , . . . , w N \ p\, P2, ■ ■ ■ , Pn} 

be a hyperbolic IFS with probabilities. Let /x be the associated invariant measure. 
Let s e (0, 1) be a contractivity factor for the IFS. Let M : V(X) -> V(X) be the 
associated Markov operator. Let v e V(X). Then 

d H (v, M(v)) 



d H (v, fi) < 



Proof This is a corollary of Theorem 6.1. 

We conclude this section with a description of the application of Theorem 6.3 to 
an inverse problem. The problem is to find an IFS with probabilities whose invariant 
measure, when represented by a set of dots, looks like a given texture. 

A measure supported on a subset of R 2 such as ■ can be represented by a lot of 
black dots on a piece of white paper. Figures IX.248 and IX.250 provide examples. 
The dots may be granules of carbon attached to the paper by means of a laser printer. 
The number of dots inside any circle of radius \ inch, say, should be approximately 
proportional to the measure of the corresponding ball in 1R 2 . A gray-tone image in a 
newspaper is made of small dots and can be thought of as representing a measure. 

Let two such images, each consisting of the same number of points, be given. 
Then we expect that the degree to which they look alike corresponds to the Hutchin- 
son distance between the corresponding measures. Let such an image, L, be given. 
We imagine that it corresponds to a measure v. Theorem 6.3 can be used to help to 
find a hyperbolic IFS with probabilities whose invariant measure, represented with 
dots, approximates the given image. Let iVbea positive integer. Let w t : R 2 R 2 
be an affine transformation, for i = 1, 2, . . . , N. Let 

{1R 2 ; w\, w 2 , w N \ pi, P2, Pn) 

denote the sought-after IFS. Let M denote the associated Markov operator. 

Let Pi&L mean the set of dots L after the "density of dots" has been decreased 
by a factor . For example 0.5&L means L after "every second dot" in L has been 
removed. The action of the Markov operator on v is represented by u" =l Wi(pi&L). 
This set consists of approximately the same number of dots as L. Then we seek 
contractive affine transformations and probabilities such that 

U^Wtipi&L)^ L. (1) 
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That is, the coefficients that define the affine transformations and the probabilities 
must be adjusted so that the left-hand side "looks like" the original image. 

Suppose we have found an IFS with probabilities so that equation 1 is true. Then 
generate an image L of the invariant measure of the IFS, containing the same number 
of points as L. We expect that 

L^L. (2) 

If the maps are sufficiently contractive, then the meaning of should be the same 
in both equations 1 and 2. These ideas are illustrated in Figure IX.259. 

Examples & Exercises 

6.9. Use the Collage Theorem for Measures to help find an IFS with probabilities 
for each of the images in Figures IX.260, IX.261, and IX.262. 
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Figure IX.260. Can 

you find the IFS and prob- 
abilities corresponding to 
this texture? 
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Figure IX.262. Find 
the four affine maps and 
probabilities for this tex- 
ture. 




6.10. Estimate the probabilities and transformations used to make each part of 
Figure IX.248. 

6.11. Let 

{X; wi, u; 2 , ... , w N ; p u p 2 , . . . , p N } 

be a hyperbolic IFS. Let /x denote the invariant measure. Let A denote the attractor. 
Let E denote the associated code space on the N symbols {1,2,..., N}. Let 7} : 
£ E be defined by 7}(er) = ia , for all a e E, for / = 1, 2, 3, 4. Let p denote the 
invariant measure for the hyperbolic IFS 

{E; Ti, T 2 , T 3 , T 4 ; p u p 2 , P3, pA- 

Let 0 : E — ► A denote the continuous map between code space and the attractor of 
the IFS intoduced in Theorem 4.2.1. Prove that p((p- l (B)) = /x(B) for all Borel 
subsets B of X. 

6.12. Figure IX. 263 depicts the invariant measure for the IFS {[0, 1] C K; wi(x) = 
a\x, w 2 (x) = a 2 x + Pi, 

P2}, where a\, a 2 , and e 2 are real constants such that the attractor is contained in 
[0, 1]. The measure of a Borel subset of [0, 1] is approximately the amount of black 
that lies "vertically" above it. Find a\, a 2 , and e 2 . 
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Figure IX.263. This 
figure depicts the invari- 
ant measure for the IFS 
{[0, l]cR;tui(*) = 
a\X, w 2 — a 2 x + e 2 \ 
P\, Pi), where a\, a 2 , and 
e 2 are real constants such 
that the attractor is con- 
tained in [0,1]. The mea- 
sure of a Borel subset of 
[0, 1] is approximately the 
amount of black that lies 
"vertically" above it. Can 
you find a\, a 2 , and e 2 l 
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Both the following theorem and its corollary claim that certain events occur "with 
probability one." Although this has a very precise technical meaning, it is fine to in- 
terpret it in the same way as you would interpret the statement "There is a 100% 
chance of rain tomorrow." After the statements we mention the mathematical frame- 
work used for dealing with probabilistic statements. To go further we recommend 
reading parts of [Eisen 1969]. 

The theorem below is actually true when the p«'s are functions of x, the w«'s are 
only contraction mappings "on the average," and the space is "locally" compact. 

Theorem 7.1 Let (X, d) be a compact metric space. Let 

{X; w\, w 2 , • • • , w N ; p\, pi, . . . , Pn) 

be a hyperbolic IFS with probabilities. Let (X, d) be a compact metric space. Let 
MZLo denote an orbit of the IFS produced by the Random Iteration Algorithm, 
starting at xq. That is, 

X n = W 0n O Won-x O ••• O W ai (Xo), 

where the maps are chosen independently according to the probabilities 

for n = 1,2,3, .... 



Pu P2, 



Pn, 
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Let fi be the unique invariant measure for the IFS. Then with probability one (that 
is, for all code sequences o\, a 2 , . . . except for a set of sequences having probability 
zero), 



f(x)d/j,(x) 

k=0 ^ x 

for all continuous functions f : X -» M. and all x 0 . 
Proof See [Elton 1987]. 

Corollary 7. 1 Let B be a Borel subset ofX and let fi(boundary of B) = 0. Let 

M{B, n) = number of points in {x 0 , x { , x 2 , x 3 , . . . , x n } n B, for n = 0, 1,2, 

Then, with probability one, 

M(B)=lim (^>) 

„^oo [(„ + !)) 

for all starting points x 0 . That is, the "mass" of B is the proportion of iteration steps, 
when running the Random Iteration Algorithm, which produce points in B. 

Let's explain more deeply the context of the statement "with probability one." Let 
E denote the code space on the TV symbols {1,2, . . . , N}. Let p denote the unique 
Borel measure on E such that 

p(C(a u a 2 , a m )) = p a{ p ai > • • Po m 

for each positive integer m and all o\ , a 2 , . . . ,o m e {1,2, . . . , N}, where 

C{o\, o 2 , o m ) = {a> e E : a>\ — o x , co 2 = a 2 , . . . , co m = a m \. 

Then This measure provides a means for assigning probabilities to sets 

of possible outcomes of applying the Random Iteration Algorithm. Let us see how 
this works. 

When the Random Iteration Algorithm is applied, an infinite sequence of symbols 
o)\, o) 2 , a>3, . . ., namely a code co = a>ico 2 a>2 . . . e £, is generated. Provided that we 
keep xo e X fixed, we can describe the probabilities of orbits {x n } in terms of the 
probabilities of codes co. So we examine how probabilities are associated to sets of 
codes. 

The Random Iteration Algorithm is applied and produces a code co e E. What is 
the probability that co x = 1? Clearly it is p x = p(C(l)). What is the probability that 
co\ = o\, co 2 = o 2 , . . ., and co n =c n l Because the symbols are chosen independently, 
it is 

p(C(a u a 2 , tT m )) = p ax p ai ... p<j m . 

Let B denote a Borel' subset of E. What is the probability that the Random Itera- 
tion Algorithm produces a code a e Bl It is at least intuitively reasonable that it is 
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p(B). This can be formalized, see, for example [Eisen 1969]. The measure p pro- 
vides a means of describing the probabilities of outcomes of the Random Iteration 
Algorithm. 

Here is a heavy way of stating the central part of Theorem 7.1. "... Let x 0 e X. 
Let B c S denote the set of codes a e E such that 



1 " f 
lim ~y^f( Xk )= / f(x)dfi(x), 

n^oo n + 1 Jx 



for all jc 0 eX and all continuous functions / : X -> R, where 

x n = ™<j n o w an _ x o . . . o w ai (xo). 

Then B is a Borel subset of E and p(B) = 1." A similar heavy restatement of the 
corollary can be made. 

Examples & Exercises 

7.1. This example concerns the IFS 

{[0, 1]; -x, -x + 0.5, 0.5}. 
2 2 2 

Show that the invariant measure fi is such that /z([x , x + 8]) = A when [x , x + 8] is 
a subinterval of [0,1]. Deduce that if / : [0, 1] -> R is a continuous function then 



f f(x)dx= f fdfi. 

J0 J [0,1] 



Let f{x) = 1 + x 2 . Compute approximations to the latter integral with the aid of 
Elton's theorem and the Random Iteration Algorithm. Compare your results with the 
exact value | . 

7.2. This example concerns the IFS 

{■ c R 2 ; wi, w 2 , w, wa\ 0.25, 0.25, 0.25, 0.25} 

corresponding to the collage in Figure IX.255(a). Let fi denote the invariant mea- 
sure. Argue that /x is the uniform measure that assigns "measure" dxdy to an in- 
finitesimal rectangular cell of side lengths dx and dy. Use Elton's theorem and the 
Random Iteration Algorithm to evaluate approximations to 



L 



(x 2 + 2xy + 3y 2 ) dxdy. 

Compare your approximations with the exact value. 

7.3. This example concerns the IFS 

, 111 

{A c 0T; w lt w 2 , w 3 ; -, -, -}, 

where 

11 111 111 

wi(x,y) = (-x,-y), w 2 (x, y) = (-x, -y + -), w 3 (x, y) = (-x + -, -y), 
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and A is the attractor of the IFS, our old friend. Let [i denote the invariant measure 
of the IFS. Argue that [i provides a good concept of a "uniform" measure on A. Use 
Elton's theorem and the Random Iteration Algorithm to compute approximations to 

/ (jc 2 + 2jry + 3y 2 )dxdy. 
J A 

In Chapters II, III, and IV, we introduced the space of shifts on N symbols. 
It was mentioned in passing in Chapter IV that any invertible mixing function could 
be represented by a baker's transformation with "uneven cutting and stretching." We 
are now in a position to show how this comes about using an example involving two 
simple IFS. The same model with some necessary refinements yields the code space 
mixing model used to justify the representations. It is easier to visualize without the 
refinements, as we present it here. It is one of the most important properties involved 
with the modelling of physical chaos. 

We begin with perhaps the most simple of all IFS with probabilities. On the 
interval [0, 1], we define the just-touching IFS with N maps and with probabilities 

{[0, 1]; w u w 2 ,..., w N ; p u p 2 , ■ • • , Pn), 

where 

1 



wi(x) = 


— JC 

N 






w 2 (x) = 


1 

— JC 

N 


+ 


1 

N 


w 3 (x) = 


1 

— JC 

N 


+ 


3 
N 


w N (x) = 


1 

— JC 

N 


+ 


N 



N 



and the probabilities are arbitrary, subject to the usual condition 

N 

Associated with this IFS there is an invariant measure on [0, 1], which we denote 
by v. 

Now we define another IFS on [0, 1], this time without probabilities, using the p, 

from above. On [0, 1] define the IFS 

i 

{[0, 1]; v u v 2 ,..., v N }, 



where 
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V\(x) = p\x 
V2(X) = P2X + P\ 

v 3 (x) = p 3 x + (P\ + Pi) 

k-\ 

v k (x) = PkX + ^ Pi 
i = \ 



N-l 



v N (x) = PnX + ^2 Pi- 

i = \ 

This IFS is also just-touching by construction and, because the probabilities from the 
first IFS sum to one, has as its attractor the interval [0, 1] as well. We are going to 
use it to define an equivalent metric on [0, 1] as follows: 

Each point has a unique address under this IFS in code space, except the points 
Vi(A), whose multiple addresses correspond to 

(T = i7T^T=(i + 1)0. 

These are precisely the points in a base A/" expansion of a real number which 
are equated to form the real line. We denote the value of a point x with address 
jc 1*2*3 ... in this new metric space to be the real number with N-ary expansion 
.* 1*2*3 .... In effect we have given each point the numeric value that would corre- 
spond to having measured its distance from say 0 with a ruler on which the spacing 
of the tick marks had been made uneven in a very specific way by the IFS. 

With these values, the space is still [0,1], but we put a metric on it by assigning 
the distance between two real numbers to be the distance measured with a "normal" 
ruler. Another way to put it is that we take the normal interval [0, 1] and assign the 
distance between two points to be the distance between the addresses corresponding 
to their N-ary expansions in the above IFS. Thus if N = 10 for instance, the distance 
between .251 and .137 is not .251 - .137, but rather the distance between the points 
with addresses 2510 and 1370 in the IFS {[0, 1]; v u v 2 , ■ • • , uiO}. We will call this 
space [0, \] p , and the distance function d p to avoid confusion. 

We have a metric space, so we will now assign a Borel measure to it by defining 
H([a, b]) = /a ((a, b)) = d p (a, b), which is uniform for this metric space. And to 
proceed with the example, we need a function, / : [0, 1] -+ [0, \] p which we define 
by f(x) =(point with value x in [0, Because the definition was very careful to 
preserve the ordering of the real line and its conventions about multiple addressing, 
/ is both a homeomorphism and a metric equivalence. Because it is continuous, it 
is also what is called a measurable function in that if A € B([0, \] p ) then / _1 (A) e 
£([0,1]). 

7.4. Show that / is measure -pre serving with respect to the invariant measure on v 
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associated with 

{[0, 1]; w u w 2 , w N ; p u p 2 , . . . , p N }\ 
that is, that for any Borel subset A e [0, 1], we have v(A) = 

We now have the machinery to cast the Random Iteration Algorithm entirely in 
terms of IFS with no recourse to randomness. It is really a deterministic model, with 
the random part coming in to help when a very simple statement made all the time 
in mathematics turns out to be something a computer cannot do. 

The exact transfer of the Random Iteration Algorithm to the model using the 
space ([0, \] p , d p ) looks like this: define the function g : [0, 1] -» [0, l] p by 

d p (g(x),0) = x. 

Define the map h : [0, l] p — > {0, 1, 2, . . . , N — 1} defined by h(p) = [Np] where 
[■] is the greatest integer function. In other words, take the first /V-ary digit of the 
value of the point p e [0, \] p . Define the map y : [0, 1] -> [0, 1] given by y(x) = 
Nx mod N. Then the Random Iteration Algorithm is precisely the iteration of the 
map R : [0, 1] x X given by 

R(p, x) = (y(p), w hog(p) (x)). 

Where does the random part of the algorithm come in? We need it to "pick a real 
number." One can think of the random number at each iteration as a function to get 
the next digit of the real number we "picked." In the above expression, we get a 
random number and find out which function to use via h(g(p)). Then we iterate the 
IFS using Whog{ p ){x), and in order to have a new "random number" the next time, we 
advance p to the next digit using y(p). 

Now, consider the space [0, \] p x [0, 1]. Think of it as a square with coordinates 
spaced unevenly in the x direction and evenly in the y direction. Your "usual" point 
in the square (where here usual means with probability one) has an /V-ary expansion 
for y in which every digit occurs with equal probability, while the x value has an N- 
ary expansion in which 0 occurs with probability p\, 1 occurs with probability p 2 , 
etc. 

7.5. Draw a diagonal from (0, 0) to (1, 1) on this square. Show that this statement 
is still true if we pick a "usual point" from this diagonal. 

7.6. Draw a smooth curve from (0, 0) to (1, 1) on the square. Then the statement is 
still true if we pick a "usual point" from this curve. 

By using the diagonal in exercise 7.5, we can take a point x in [0, 1] and map it to 
anew point x, by putting x along the vertical coordinate and reading the horizontal 
coordinate like a web diagram. In terms of all the functions we have defined, this 
operation is x = f~ l (g(x)). Under the original IFS with probabilities, this new point 
will, with probability 1, have an orbit under the shift dynamical system {A; S} with a 
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distribution of dots identical to the one we wou!d get by using* the Random Iteration 

the uneven coordinates correspond .» 
"Idfor r^^^. allowed to be infinite.) It is a mixing function, so 

here accurately reflects the behavior of the IFS on addresses. 



8 Application to Computer Graphics 

probabilities can be produced. The idea is very simpic 

{€■ 0.5z + 24 + 24/, 0.5z + 24i, 0.5z; 0.25, 0.25, 0.5}. 
A viewing window and a corresponding array of pixels P, is specified The Random 

noints JV( /> •) which lie in the pixel P„ are counted. The pixel P„ is assignee ^ 
Te C ) ™m,« By Elton's theorem, if numi.s is large, this value should be a 

in colors determined from their measures. 

The following program implements this procedure. It .s is written in BASIC 
It ™s wihout modification on an IBM PC with Enhanced Graph.cs Adaptor and 
Turbobasic. 

Program 1. (Uses the Random Iteration Algorithm to Make a "Picture" of the Invariant 
Measure Associated with an IFS with Probabilities) 
screen 9 : els 'Initialize graphics. 

dim 8(51.51) 'Allocate array of pixels. 

a(2)=0.B : b(2)-0 : c(2)-0 : d(2)-0.B : .(2)-0 : «2>-24 
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a(3)=0.5 : b(3)=0 : c(3)=0 : d(3)=0.5 : e(3)=0 : f(3)=0 

'Probabilities for the IFS; they must add to one! 
p(l)=0.25 : p(2)=.25 : p(3)=.5 

mag=l 'Magnification factor. 

'Increase the number of iterations as you magnify. 
numits=5000 

factor =100 'Scales pixel values to color values. 

'This is the number of colors you are able to use. 
numcols=8 

for n=l to numits 'Random iteration begins! 

r=rnd : k=l 'Pick a number in [0,1] at random. 

if r > p(l) then k=2 

if r > p(l)+p(2) then k=3 

'Map k is picked with probability p(k) . 
newx=a[k]*x + b[k]*y + e[k] 
newy=c[k]*x + d[k]*y + f [k] 

x=newx : y=newy 

i=int(mag*x) : j=int(mag*y) 'Scale by magnification factor 

if (((i < 50) and (i>=0))and((0=<j) and (j<50))) then 
' If the scaled value is- 

s(i, j)=s(i, j)+l 

'...in the array add one to pixel (i,j). 
end if 

pset(i.j) 'Plot the point. 

if instat then end 'Stop if a key is pressed, 
next 
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for i=0 to 49 'Normalize values in pixel array, and plot... 

for j=0 to 49 '...in colors corresponding to the normalized... 

'...values of the numbers s(i,j). 

col=s (i , j ) *numcols*f actor*mag*mag/numits 

'Plot the pixel (i, j) in the color determined by. . . 
pset (i, j) ,col 

next j '...its measure. 

next i 

end 

The program allows the user to zoom in on a piece of the rendered measure 
by altering the value of the magnification parameter mag. The result of running 
an adaptation of this program on a Masscomp workstation and then printing the 
contents of the graphics screen is shown in Figure IX.264. 

Rendered invariant measures for IFS's acting in R 2 are also shown in Fig- 
ure IX.265. 

By carrying out some simple computergraphical experiments, using a program 
such as the one above, we discover that "pictures" of invariant measures of IFS's 
possess a number of properties, (i) Once the viewing window and color assignments 
have been fixed, the image produced is stable with respect to the number of itera- 
tions, provided that the number of iterations is sufficiently large, (ii) Images vary 
consistently with respect to translation and rotation of the viewing window, and 
with respect to changes in resolution. In particular they vary consistently when they 
are magnified, (iii) The images depend continuously on the IFS code, including the 
probabilities. Property (i) ensures that the images are well defined. The properties in 
(ii) are also true for views of the real world seen through the viewfinder of a cam- 
era. Property (iii) means that images can be controlled interactively. These properties 
suggest that IFS theory is applicable to computer graphics. 

We should, if we have done our measure theory homework, understand the rea- 
sons for (i) and (ii). They are consequences of corresponding properties of Borel of 
measures on R 2 . Property (iii) follows from a theorem by Withers [Withers 1987]. 

Examples & Exercises 

8. 1 . Rewrite Program 1, section 8, in a form suitable for your own computer envi- 
ronment. Adjust numits and factor to ensure that a stable image results. Then make 
experiments to verify that the conditions (i)-(iii) above are verified. For example, to 
test the consistency of images with respect to changes in resolution you should try 
mag = 0.5, 1, and 1.5. Unless you have a very powerful system, do not make ex- 
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Figure IX.264. The 

result of running a mod- 
ified version of Program 
9.8.1 and then printing 
the contents of the graph- 
ics screen in gray tones. 
A rendered picture of a 
measure is the result. 



treme adjustments. For example, do not choose mag too small, otherwise you will 
need a very large value for numits. 

. Applications of fractal geometry to computer graphics have been investigated 
by a number of authors including Mandelbrot [Mandelbrot 1982], Kawaguchi 
[Kawaguchi 1982], Oppenheimer [Oppenheimer 1986], Fournier et al. [Fournier 
1982], Smith [Smith 1984], Miller [Miller 1986], and Amburn etal. [Amburn 1986]. 
In all cases the focus has been on the modelling of natural objects and scenes. Both 
deterministic and random geometries have been used. The application of IFS theory 
to computer graphics was first reviewed in [Demk 85]. It provides a single frame- 
work that can reach an unlimited range of images. It is distinguished from other 
fractal approaches because it is the only one that uses measure theory. 

The -modelling of natural scenes is an important area of computer graphics. Pho- 
tographs of natural scenes contain redundant information in the form of subtle pat- 
terns and variations. There are two characteristic features: (i) the presence of com- 
plex geometrical structure and distributions of color and brightness at many scales; 
and (ii) the hierarchical layout of objects, (i) Natural boundaries and textures are not 
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Figure IX.265. Ren- 
dered invariant measures 
for IFS's of two maps. 



(a) 




smoothed out under magnification; they preserve some degree of geometrical com- 
plexity, (ii) Natural scenes are organized in hierarchical structures. For example, a 
forest is made of trees; a tree is a collection of boughs and limbs along a trunk; on 
each branch there are clusters of leaves; and a single leaf is filled with veins and 
covered with fine hairs. It appears often in a natural scene that a recognizable en- 
tity is built up from numerous near repetitions of some smaller structure. These two 
observations can be integrated into systems for modelling images using IFS theory. 



Examples & Exercises 

8.2. Examine a good-quality color photograph of a natural scene, such as can be 
found in a Sierra Club calender, or an issue of National Geographic. Discuss the 
extent to which (i) and (ii) are true for that photograph. Be specific. 
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Figure IX.265. (b) 



In [Barnsley 1988a] it is reported that IFS theory can be used efficiently to model 
photographs of clouds, mountains, ferns, a field of sunflowers, a forest, seascapes 
and landscapes, a hat, the face of a girl, and a glaring arctic wolf. 

There are two parts to making any computer graphics image: geometrical mod- 
elling and rendering. Consider an architect making a computergraphical house: first 
she defines the dimensions of the floor, the roof, the windows, the shapes of the 
gables, and so on, to produce the geometrical model. Traditionally this is specified 
in terms of polygons, circles, and other classical geometrical objects that can be con- 
veniently input to the computer. This model is not a picture. To make a picture, the 
model must be projected into two dimensions from a certain point of view and dis- 
tance, discretized so that it can be represented with pixels, and finally rendered in 
colors*on a display device. 

Here we describe briefly the software system designed by the author, Alan Sloan, 
and Laurie Reuter, which was used to produce the color images that accompany 
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this section. More details can be found in [Reuter 1987] and [Barnsley 1988a]. The 
system consists of two subsystems known as Collage and Seurat. Collage is used for 
geometrical modelling, while Seurat is used for rendering. 
Collage and Seurat process IFS structures of the form 



where the maps are affine transformations in 1R 2 . An IFS is represented by a file that 
consists of an IFS code, where each coefficient is written with a fixed number of bits 
Let /x denote the invariant measure of such an IFS and let A denote the attractor. 
The pair (A, /x) is referred to as an underlying model. The attractor A carries the 
geometry while /x carries the rendering information. One can think of the IFS code, 
or equivalently (A, /x), as being analagous to the plans of an architect. It corresponds 
to many different pictures. 

Collage is a geometrical modelling system used to determine the coefficients 
of the affine transformations w\, W2, . . , w^. It is based on the Collage Theorem. 
Seurat is a software system for rendering images starting from an IFS code. An 
image is produced once a viewing window, color table, and resolution have been 
specified. This is achieved using the Random Iteration Algorithm. Its mathematical 
basis is Elton's Theorem. Seurat is also used in an interactive mode to determine the 
probabilities and color values. 

The input to Collage is a target image, which we denote here by T. For example, 
T may be a polygonal approximation to a leaf. We suppose that 



and that the screen of the computer display device corresponds to ■. T is rendered 
on the graphics workstation monitor. An affine transformation 



is introduced, with coefficients initialized at a\ — d\ = 0.25, b\ = c\ = e\ = f\. The 
image w\{T) is displayed on the monitor in a different color from T. w\{T) is a 
quarter-sized copy of T , centered closer to the point (0, 0). The user now interac- 
tively adjusts the coefficients with a mouse or some other interaction technique, so 
that the image w\{T) is variously translated, rotated, and sheared on the screen. The 
goal of the user is to transform w\ (T) so that it lies over part of T. It is important that 
the dimensions of w\(T) are smaller than those of T, to ensure that w\ is a contrac- 
tion. Once w\{T) is suitably positioned, it is fixed, and a new subcopy of the target, 
W2(T), is introduced. u>2 is adjusted until u>2(T) covers a subset of those pixels in 
T that are not in w\{T). Overlap between w\{T) and W2(T) is allowed, but in gen- 
eral it should be made as small as possible, for efficiency. New maps are added and 
adjusted until U^ =l Wj(T) is a good approximation to T. The output from Collage is 



[R 2 ; wi, w 2 , w N ; p u p 2 , Pn ■ n = 1,2, N), 



T C ■ = {(*, y) € R 2 : 0 < x < 1, 0 < y < 1}, 
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the resulting IFS code. The probability pj is chosen proportional to \ajdj - bjCj\ if 
this number is nonzero, and equal to a small positive number if the determinant of 
Aj equals zero. 

The input to Seurat is one or more IFS codes generated by Collage. The view- 
ing window and the number of iterations are specified by the user. The measures of 
the pixels are computed. The resulting numbers are multiplied by the inverse of the 
maximum value so that all of them lie in [0,1]. Colors are assigned to numbers in 
[0, 1] using a color assignment function. The default is a gray scale where the inten- 
sity is proportional to the number, such as 0 corresponds to black and 1 corresponds 
to brightest white. The coloring and texture of the image can be controlled through 
the probabilities and the color assignment function. Although one does not explic- 
itly use it, Theorem 9.6.3 lies in the background and can help in the adjustment of 
the probabilities. 

Color Plate 21 shows some smoking chimneys in a landscape. We obtained the 
IFS codes for the elements of this image we obtained using Collage. Different color 
assignment functions are associated to different elements in the image. The image 
was rendered using Seurat. 

The consistency of images with respect to changes in resolution is illustrated in 
Color Plate 22, which shows a zoom on one of the smokestacks in Color Plate 21. 
The number of iterations must be increased with magnification to keep the number 
of points landing within the viewing window constant. This requirement ensures the 
consistency of the textures in an image throughout the magnification process. 

Color Plates 23 and 24 show various renderings of leaves produced by Seurat. 

Color Plate 25 shows a sequence of frames taken from an IFS encoded movie en- 
titled A Cloud Study [Barnsley 1987]. The smooth transition from frame to frame is 
a consequence of the continuous dependence on parameters of the invariant measure 
of the IFS for the cloud. 

Color Plates 26, 27, and 28 were encoded from color photographs. Segmenta- 
tion according to color was performed on the originals to define textured pieces. IFS 
codes for these components were obtained using Collage. The IFS data base con- 
tained less than 180 maps for the Monterey seascape, and less than 160 maps for the 
Andes Indian girl. 

The two primitives, a leaf and a flower, in Color Plate 29 were used as condensa- 
tion sets in the picture Sunflower Field, Color Plate 30. Here we see the hierarchical 
structure: the leaf is itself the attractor of an IFS; and the flower is an overlay of 
four IFS attractors. The leaf is a condensation set for the IFS that generates all of 
the leaves. The flower is a condensation set to an IFS that generates many flowers, 
converging to the horizon. In the pictures Sunflower Field and Black Forest, shown 
in Color Plates 31-34, the primitives were displayed from back to front. The data 
bases for the Sunflower Field and Black Forest contain less than 100 and 120 maps, 
respectively. Notice the 1 shadows behind the little trees in the background in Color 
Plate 32. The winter forest pictures were obtained by adjusting the color assignment 
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function. The important point is that once the adjustment has been made, the image 
and the zoom are consistent. 

Examples & Exercises 

8.3. Use the Collage Theorem to help you find an IFS code for a leaf. Adjust 
your version of Program 1 in section 8 to allow you to render images of associated 
invariant measures. Assign colors in the range from red through orange to green. 
Adjust the probabilities. Obtain a spectacular color picture of the leaf showing the 
veins. Make a color slide of the output. To photograph a picture on the screen of a 
computergraphics monitor, use a telephoto lens. Mount the camera on a tripod, and 
take the photograph in a darkened room, on Ectachrome 64 ASA color slide film, 0. 1 
sec exposure, f-stop 5.6. For possible publication, submit the color slide, together 
with a letter of copyright assignment, to Michael Barnsley, Iterated Systems, Inc., 
5550-A Peachtree Parkway, Suite 650, Norcross GA 30092 USA. Include a self- 
addressed envelope. 

8.4. Obtain a very powerful computer with good graphics. Find the heirarchical 
IFS codes for the Sunflower Field. Replace the sunflowers by roses. Fly into your 
picture, to explore forever that scent-filled horizon. You are on your own. 



Chapter X 

Recurrent Iterated Function 
Systems 



1 Fractal Systems 

The goal of this chapter is to describe some general systems that can be used to con- 
struct deterministic fractals. We are concerned with the invention of mathematical 
machines for designing and controlling fractals. 

We use the name "recurrent iterated function systems" loosely to encompass the 
systems introduced in this chapter. Examples include "IFS's" in which the applica- 
tion of transformations to points depends on the "addresses" of the points. Other 
examples include "IFS's with probabilities" which have invariant measures that can 
be computed by means of algorithms of the "Chaos Game" type, wherein, instead 
of each transformation being applied with a fixed probability, there are different 
probabilities that depend on which transformation was applied previously. A unified 
framework for presenting IFS, recurrent IFS, and future developments in the theory 
of deterministic fractals is provided by the idea of fractal systems. 

In this section we introduce fractal systems, their associated fractals, and the 
objects or models which they can be used to approximate. We do this in such a 
way as to help the reader understand how to invent a fractal system appropriate to 
the type of model she or he is describing, without having first to develop a new 
theory from the bottom up. The ideas are loosely stated in order to convey most 
simply the underlying principles, which seem to be of general applicability and to 
be very powerful. From this author's experience, it seems that any framework for a 
fractal system, once decided upon, can be readily fleshed out with the appropriate 
definitions and logical infrastructure to provide a full and clean theory. 

Roughly speaking, a fractal system consists of a means for generating determin- 
istic fractals, associated with an underlying space X, and a set or space of objects or 



379 



380 Chapter X Recurrent Iterated Function Systems 

models, Y, which these fractals can be used to approximate. Throughout this chap- 
ter we use □ to denote a closed bounded subset of IR 2 such as {{x, y) e IR 2 : 0 < x, 

?<!}• 

Already in this book we have looked at several very different fractal systems, as 
described in the following examples. 

Example (i). One example of a fractal system is defined by IFS's of contractive 
affine transformations acting on X = □ c IR 2 : the underlying space is □; the fractals 
of the system are attractors of all of the IFS's; and the space of models, Y, which 
these fractals can be used to approximate, consists of all (compact) subsets of □, 
namely Y = H(X). In this example the models may represent monochrome pictures, 
with the sets representing the black parts of the pictures, and their complements 
representing the (background) white parts. The fractal system itself is a device for 
approximating the models by means of the fractals that it generates. 

Example (ii). Another example of a fractal system is defined by the IFS's of 
contractive affine transformations on □ c IR 2 , with probabilities. The fractals of this 
system are the attractors of the associated Markov operators, as described in Chapter 
IX. They lie in the space Y = V(D), the set of all normalized Borel measures on 
□ c IR 2 . Here the underlying space X is □ c IR 2 ; its points do not belong to the 
fractals of the system because they are measures, not sets of points. In this example 
the models may represent grayscale pictures, with the amount of light reflected by a 
subset of the picture represented by the measure of that subset. Again, the fractal 
system provides a means for approximating the models, using the fractals that it 
generates. 

Example (iii). A third example of a fractal system involves the space X = 
[0, 1] C IR 2 , with Y = C[0, 1], the continuous real valued functions on [0, 1]. In this 
system the fractals are the fractal interpolation functions on [0, 1] that can be gener- 
ated by affine transformations, as described in Chapter VI; and the the models may 
represent time series data or stock market prices. 

Further examples of fractal systems are introduced in this chapter, and involve 
recurrent iterated function systems. Yet others, involving local iterated function sys- 
tems and appropriate to the modelling of real world images, are described in [BH]. 1 
All of these fractal systems use the same basic ingredients, which we describe next. 

(1) An underlying metric space (X, d) needed to define the fractals and the mod- 
els in the system. For example, X may be IR 2 , IR 3 , or a subset of one of these 
spaces, such as □ c IR 2 . Typically (X, d) is complete, and bounded subsets of 
it are compact. 

(2) A space of models Y = Y(X). Each point of Y is a model, and models are de- 
fined with the aid of the space X. The fractals generated by the fractal system 



1 M. F. Barnsley and L. P. Hurd, Fractal Image Compression, (1993), A. K. Peters, Welles- 
ley, Massachusetts. 
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also belong to Y. Examples of Y include spaces of sets, function spaces, and 
measure spaces. We also need a metric h on the space Y, such that (Y, h) is a 
complete metric space. Examples of (Y, h) include (ft (X), the Hausdorff dis- 
tance generated by d), (C[0, 1], h(/, g) = max{|/(x) - g(x)\ : x e [0, 1]}), 
and CP(X), d h ) where d h is the Hutchinson distance between measures. 
(3) A contractive operator O which acts on the space (Y, h). That is, the operator 
O is such that there exists a real number s with 0 < s < 1, and 
h(O(0), O(V0) < s • h(0, f) for all c/),feY. 
Typically, the operator O is constructed with the aid of elementary contractive 
functions that act upon the underlying space X. 
For the examples of fractal systems, (i), (ii), and (iii) begun above, the operators 
0 are set up as follows: 

Example (i) (continued). X = □ C K 2 , w, ; : □ □ is a contractive transfor- 
mation for each / = 1, 2, . . . , N, Y = ft(X), and O : Y Y is denned by 

O(Y) = W(Y) = {J{wi(x) : x e Y}. 
Remark that O is built up using contractive transformations acting on the underlying 
space. 

Example (ii) (continued). X = □ c K 2 , {X; w u • • • , "^n; pu ■ ■ ■ , Pn} are 

IFS's with probabilities, as in section 6 in Chapter IX, with the w.-'s as in (i) above, 
and Y = 7>(X); we can then define a corresponding fractal system by taking O to 
be the associated Markov operator as defined in section 6 in Chapter IX; that is, 
O : Y Y is defined by 

O(v) = M{y) - p\v o Wj" 1 + piv o W2 1 H H ° 

We know from the theory of Chapter IX that O is contractive in the Hutchinson 
metric. Again, remark that O is built up using contractive transformations, the w,\ 
acting on the underlying space. 

Example (iii) (continued). In this example fractal interpolation functions, as 
described in Chapter VI, are used. X = [0, 1] C R, and Y = T = C[0, 1]. Then O : 
Y Y is the operator T : T T defined in the proof of Theorem 2.2 in Chapter 
VI. In the notation of section 2 in Chapter VI, this is, for f ef, 

0(/) = (Tf)(x) = c n l~\x) + d n f(l-\x)) + f n 
for x G [x„_i , x n ], forn = l,...,N. 
Then O is a contraction mapping with respect to the supremum distance on C[0, 1]. 
Again, notice how the operator O is built up from contractive transformations acting 
on the underlying space. 

In practice, the contractive operator O is constructed from finite collections of 
contractive affine transformations; for example, affine transformations acting in two 
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dimensions. Furthermore, the coefficients of these constituent functions can be re- 
stricted to lie in finite sets, defined for example by rounding. Then O itself can be 
described by finite sets of discrete coefficients, the full set of which represents the 
"code" for the operator O. 

The consequences of having the ingredients 1, 2, and 3 can be summarized in the 
following Theorems and Expections: 

Theorem 1 . 1 Existence Of Attractors. Since O is contractive, and the metric 
space Y is complete, there exists an unique attractor (f) € Y such that 

O(0)=0 
Proof See Theorem 6. 1 in Chapter III. 

Expectation 1.1 Fractal Character of Attractors. We anticipate that 

0 is a fractal: that is, we expect that (f) has a resolution-independent, infinitely- 
magnifiable character. This is because of the contractivity of the functions from 
which O is constructed: the whole invariant set is the same as a sum or union of 
contractions applied to it, and thus it is made of shrunken copies of (parts of) itself. 
Depending on the way in which the contractions act, the focus may be on contrac- 
tivity of various spatial dimensions, and/ or measure theoretic contractivity, and we 
expect that the attractor <p will inherit corresponding fractal characteristics. 

Theorem 1 .2 Computation of Attractors. To compute 4>, we can use the 

fact that if iff eY then the result of repeatedly applying O to \\r converges to the 
attractor <p ; that is, 

lim O on (\Js)=4>. 

n— >oo 

Moreover, if there exists a real constant C such that h(0i, <fe) < Cfor all 0i, <fe € Y, 
then we have the error estimate 

h(O on (f),<p) <s n C. 
Proof See Theorem 6.1 in Chapter III. 

The latter equation tells us that the fixed point, or attractor, can be computed 
by algorithms of the "Photocopy Machine" type. The error estimate allows one to 
predict the number of iterations required to achieve a given accuracy. 

Theorem 1 .3 General Collage Theorem Estimate. The distance between 

y\r € Y and the attractor (f)ofO is bounded by the estimate 

h(0, VO < —r. — -• 

(1-5) 

Proof See Lemma 1 1.1 in Chapter III. 

The set O(V0 is called a collage of\Js, while the distance h(\J/, 0(\J/)) is called the 
corresponding collage error. The theorem says that if we wish to find an operator 
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0 whose attractor 0 is approximately yjr, then we have only to solve the problem 
of choosing O such that application of O to f does not change f very much, as 
discussed in Chapters III and IX. 

This latter simple prescription is of central importance, yet easy to miss. We 
repeat it in slightly different terms. Suppose that we have a fractal system, defined by 
Y(X) and O, where O is a set of operators O as above. We seek an operator OeO 
whose attractor 4> approximates \jr e Y. To do so, we adjust O so that the collage 
error h(^, O(V0) is as small as possible. Then O defines our fractal approximation 
0. To store <p it suffices to store O. In the end, one thinks of the controllable family 
of operators O as the fractal system. 

Whenever we have the ingredients 1, 2, and 3, we have a fractal system. A frac- 
tal system is a mechanism for finding fractal approximations (p for models f e Y. 
Examples (i), (ii), and (iii) above constitute interactively controllable systems for 
fractal modelling applied to diverse problems: in (i), sets of points are approximated 
by fractals made of sets of points; in (ii) measures are approximated by fractal mea- 
sures; and in (iii), continuous functions are approximated by fractals that are graphs 
of continuous functions. In each case the parameters that control the problem are 
embedded in the operator O, which is selected and tailored to produce an appara- 
tus for fractal approximation, a fractal system, appropriate to the modelling problem 
under consideration. 

We note that fractal systems provide deterministic fractals rather than stochastic 
fractals. Once the operator O has been selected, its attractor, the fractal that it de- 
fines, has been fixed once and for all. There is only one fractal associated with each 
operator. These fractals are typically low information content members of a larger 
space, the space of models in which they lie; in much the same way as the rational 
numbers are finite information content members of the information rich space of all 
real numbers. 

In the following sections we develop more elaborate fractal systems, based on 
recurrent IFS. Their introduction was begun in section 1 of this chapter. These gen- 
eralize earlier systems in an elegant manner. They can be used to design machines 
to do elaborate fractal modelling, and illustrate well the design principles enunciated 
above. 



2 Recurrent Iterated Function Systems 

Recurrent iterated function systems were introduced in [BEH], 2 where more infor- 
mation can be found. 



2 M. Barnsley, J. Elton, D. Hardin, Recurrent Iterated Function Systems, Constructive 
Approximation (1989) 5, 3-31. 
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Table X. 1 . Example of a recurrent IFS code. Not all transitions are possible! 





a, 


bi 


Ci 






// 


Pi i 


Pa 


Pi3 


1 


0.5 


0 


0 


0.5 


0 


0 


0.3 


0.7 


0 


2 


0.5 


0 


0 


0.5 


0 


128 


0 


0.6 


0.4 


3 


0.5 


0 


0 


0.5 


128 


128 


0.5 


0 


0.5 



Definition 2.1 A recurrent iterated function system consists of an IFS {X; w h 

w 2 ,..., w N } together with a matrix { Pij e [0, 1] : i, j = 1, 2, . . . , N}, such that (i) 

Pn + Pi2 + Pis + • • • + PiN = 1 for I = 1, 2, .... AT 
and (ii) such that for any i and j, there exists a finite sequence of integers k,l,..., 
m e {1, 2, . . . , N} so that 

PikPki ■•■ Pmj>® 

The transition probability for a certain discrete time Markov process is p u , which 
gives the chance, or probability, of moving from state i to state j, given that the 
process is in state i. Condition (i) says that the process is row stochastic: whichever 
state the system is in, a set of probabilities is available that sum to 1, and that 
describe the possible subsequent states to which the system can transition at the 
next step. Condition (ii) says that if the system is in state i then there is a finite 
probability of reaching state j in a finite number of steps, for any pair of integers 
;,je{l,2, ...,N}. 

An IFS with probabilities provides a simple example of a recurrent IFS. The IFS 
with probabilities 

{X; w\, w 2 , w> N ; p\, pi, • • • , Pn) 

is in many ways the same as the recurrent IFS 

{X; w\, w 2 , ■ • • , wn\ (Pij)} 

when pij = Pj for all /, j e {1, 2, . . . , N}. 

We will say that a recurrent IFS is hyperbolic if the associated IFS is hyperbolic, 
that is, contractive with contractivity factor 0 < s < 1. We restrict attention to hyper- 
bolic recurrent IFS. 

Examples of codes for recurrent IFS's are presented in Tables X.l and X.2. In 
each case the space is X = IR 2 , the transformations are affine, and we use the usual 
notation 



-G)-(: :)(;)*(?)■ 



Figures X.266 and X.267 provide diagrams of the associated Markov processes. 
We say that the system is in state i if the last transformation to be applied was w h 
For the the recurrent IFS in Table X.l it is never possible to apply transformation 



2 Recurrent Iterated Function Systems 385 



Table X.2. Example of a recurrent IFS code. All transitions can occur! 





a. 




Ci 




e t fx 


Pi i 


Pa 


^3 


1 


0.5 


0 


0 


0.5 


0 0 


0.3 


0.6 


0.1 


2 


0.5 


0 


0 


0.5 


0 128 


0.1 


0.5 


0.4 


3 


0.5 


0 


0 


0.5 


128 128 


0.4 


0.4 


0.2 



0.7 0 4 Figure X.266. Markov 




process for the recurrent 
IFS given in Table X.l. 
There is no path from state 
1 to state 3, state 2 to state 
1, or state 3 to state 2. 



0.5 




Figure X.267. Markov 
process for the recurrent 
IFS given in Table X.2. 
0.2 Paths exist between all 
possible states. 



0.4 



number 3 if the system is in state 1 . It is also not possible to make a transition from 
state 2 to state 1. For the recurrent IFS in Table X.2, any transformation can follow 
any other one; that is, the system can transition from any state to any other state, in 
one step. 

A recurrent IFS possesses a unique measure theoretic attractor, [BEH]. This is an 
invariant measure that can be computed by means of a generalized "Chaos Game" 
algorithm. It can also be computed using a wierd "Photocopy Machine." 

Example of the Chaos Game Algorithm for a Recurrent IFS. We 

present this algorithm for the case of the the recurrent IFS codes in Tables X.l and 
X.2. The algorithm yields an orbit {(jc„, y n ) : n = 0, 1, 2, . . .} that, with probability 
one, describes the unique invariant measure associated with the recurrent IFS. The 
relationship between the orbit and the measure is much the same as that described in 
Chapter IX for the case of standard IFS with probabilities. 

(1) Choose a starting point (jc 0 , yo) e R 2 . It is desirable that the starting point lies 
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as close as possible to the attractor. Clearly this lies in [0, 256] x [0, 256]. 
Thus, we might choose (xq, yo) = (0, 0). 

(2) Choose a initial state 5 0 = {1, 2, 3]. Any other state in the set {1, 2, 3] could 
be used. The Markov process associated with the transition matrix (p { j) pos- 
sesses a unique stationary vector {m\,m2,m^) with m, > 0, such that 

3 

Y^rn i p ij =m j , j = l,2, 3. 
1=1 

If one knows this vector, then it is a good idea to choose the initial state to be 
the one corresponding to the largest ray. 

(3) Select s\ e {1, 2, 3] with probability p Soj associated to the choice s\ = j. 

(4) Compute (x u yi) = w Sl (x 0 , y 0 ). 

(5) Select S2 e {1, 2, 3} with probability p Slj associated to the choice s 2 = j. 

(6) Compute (x 2 , y 2 ) = w S2 (x u yi). 

(2n + 1) Select s n e {1,2,3} with probability £> s „_,y associated to the choice 

Sn =j- 

{In + 2) Compute (x n , y n ) = w Sn (x n - U y„_i). 

The result of the above calculation is supposed to be a long orbit 

Q(numits) = {(x n , y n ) : n = 1, 2, .., numits}. 

(With probability one) the value of the invariant measure fi e V([0, 256] x [0, 256]) 
of the recurrent IFS is given by the formula 

number of points in {S Pi O(numits)] 

hm /jl(S) = 

numits-» 00 numits 

for any measureable subset 5cK 2 . 

In Figure X.268 we illustrate this algorithm for the case of the recurrent IFS in 
Table X.l. In Figure X.269 we give the results of a computation, showing a long 
orbit. The invariant measure can be pictured by observing the density of the points. 

In Figure X.270 we illustrate this algorithm for the case of the recurrent IFS in 
Table X.2. In Figure X.271 we give the results of a computation, showing a long 
orbit. Again, one gets some idea of the invariant measure by looking at the density 
of the points. 

In Figure X.272 we show the structure and the attractor for the recurrent IFS in 
Table X.3. This involves four transformations which take □ into itself. The range of 
Wi is the quadrant of □ labeled i , so that the transformations provide a just-touching 
collage of the square. The transition probability P24. is zero, which means that no 
points on the attractor lie in the subquadrant labeled 24. 
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Figure X.268. Chaos 
Game for a recurrent IFS 
in which not all transitions 
are possible. The recurrent 
IFS code is given in Table 
X.l. 



IN STATE 1 


IN STATE 2 


IN STATE 3 


Jump to State 1 
with Prob 0.3 

Jump to State 2 
with Prob 0.7 


Jump to State 2 
with Prob 0.6 
Jump to State 3 
with Prob 0.4 


Jump to State 3 
with Prob 0.5 

Jump to State 1 
with Prob 0.5 




( x wYm) X of points which 
N N ^ land in S 



In the (x,y) plane, when the system jumps to State j, apply the map 
Wj to the current point (x N ,y N ) to obtain (x N+1 ,y N+1 ) = Wj(x N ,y N ). 

Thus produce a dancing point. 



Figure X.273 shows the attractor for the maps in Table X.3, for six different sets 
of transition probabilities. These sets of transition probabilites used were: 



(1) 


Pn 


= 0.3,/?i 2 = 


0.5, p n = 


0.1,Pl4 = 


0.1, 




P2\ 


= 0.1, pn = 


0.5, p 2 3 = 


0.4, p 2 4 = 


0.0, 




P3\ 


= 0.3, P32 = 


0.3, P33 = 


0.2, P34 = 


0.2, 




PA\ 


= 0.25, /?42 = 


= 0.25,/? 43 


= 0.25, P44 = 0.25; 


(2) 


Pn 


= 0.3,/?i2 = 


0.5,/?i3 = 


0.1,P14 = 


0.1, 




Pi\ 


= 0.1, p 2 2 = 


0.5, p 2 3 = 


0.4, p 2 4 = 


0.0, 




P3\ 


= 0.3, P32 = 


0.3, P33 - 


0.2, P34 = 


0.2, 




Pa\ 


= 0.25, p 4 2 -- 


= 0.25,/?43 


= 0.5, P44 


= 0.0; 


(3) 


Pn 


= 0.3,p l2 = 


0.5,p l3 = 


0.1,/?14 = 


0.1, 




Pi\ 


= 0.1, p 2 2 = 


0.5, P 23 = 


0.4, p 2 4 = 


0.0, 




P3\ 


= 0.0, P 32 = 


0.3, P33 = 


0.5, P34 = 


0.2, 




Pa\ 


= 0.0, p 42 = 


i0.5, P43 = 


0.5, pu = 


0.0. 


(4) Pn 


= 0.0, p l2 = 


0.5,pi3 = 


0.2, pu = 


0.3, 




Pi 


= 0.1, p 2 2 = 


0.4, p 2 3 = 


0.4, p 2 4 = 


0.1, 
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Figure X.269. Some 
points on the orbit pro- 
duced by the machinery in 
Figure X.268. 



256 



0 




P3i = 0.25, P32 = 0.25, pas = 0.25, p 34 = 0.25, 

P4l = 0.4, p 42 = 0.4, p 43 = 0. 1 , P44 = 0. 1 . 

(5) p n = 0.25, pi2 = 0.25, p i3 = 0.25, p M = 0.25, 
P2i = 0.25, P22 = 0.25, = 0.25, p M = 0.25, 
P3i = 0.25, P 32 = 0.25, pas = 0.25, p 34 = 0.25, 
p AX = 0.25, p 42 = 0.25, p 43 = 0.25, p u = 0.25. 

(6) p n = 0.0, pi2 = 0.34, p 13 = 0.33, p u = 0.33, 
P2i = 0.25, P22 = 0.25, P23 = 0.25, /? 2 4 = 0.25, 
p 3l = 0.25, p 32 = 0.25, pas = 0.25, p 34 = 0.25, 
p 41 = 0.25, p 42 = 0.25, p 43 = 0.25, p u = 0.25. 

Can you work out which set of the above transition probabilities corresponds to each 
of the images in Figure X.273? 

Example of the Photocopy Machine Algorithm for realizing the in- 
variant measure Of a recurrent IFS. We illustrate this algorithm for the case of 
the IFS code in Table X.l. It is illustrated in Figure X.274. 

The mathematical machine in Figure X.274 corresponds to the recurrent IFS 
in Table X.l, wherein there are three affine transformations w x , w 2 , and tu 3 , and 
wherein the transition probabilities p\ 3 , P21, and p 32 are all zero. The states of 
the system are represented by the three copy units, one of which is labeled IN- 
PUT 1/COPY1; the others are labeled INPUT2/COPY2 and INPUT3/COPY3. Each 
unit has two lens/filter components. The first copy unit is made of two lens/filter 
components; one of these components applies affine transformation w\ to INPUT1, 
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Figure X.270. Ran- 
dom walk generator as- 
sociated with the recurrent 
IFS in Table X.2. 



0.4 

Tic, Tic, Tic, Tic 



IN STATE 1 


IN STATE 2 


IN STATE 3 


Jump to State 1 
with Prob 0.3 

Jump to State 2 
with Prob 0.6 
Jump to State 3 
with Prob 0.1 


Jump to State 1 
with Prob 0.1 

Jump to State 2 
with Prob 0.5 
Jump to State 3 
with Prob 0.4 


Jump to State 1 
with Prob 0.4 

Jump to State 2 
with Prob 0.4 
Jump to State 3 
with Prob 0.2 




X 

Apply map Wj when system jumps to State j. 



attenuates the brightness of the image by factor p n , and directs its output to COPY1; 
the other of these components applies affine transformation w 2 to the input im- 
age, attenuates the brightness of the image by a factor p l2 , and directs its output 
to COPY2. The second copy unit has two lens/filter components; the first applies 
affine transformation w 2 to the input image INPUT2, attenuates the brightness of 
the image by factor p 22 , and directs its output to COPY2; the other of these com- 
ponents applies affine transformation w 3 to the input image INPUT2, attenuates the 
brightness of the image by a factor p 23 , and directs its output to COPY3. The third 
copy unit has two lens/filter components; the first applies affine transformation w 3 to 
the input image INPUT3, attenuates the brightness of the image by factor p-&, and 
directs its output to COPY3; the other of these components applies affine transfor- 
mation w x to the input image INPUT3, attenuates the brightness of the image by a 
factor pn, and directs its output to COPY1. 

It is important to note that the three inputs and the three outputs are set up so that 
they all represent the same subset □ of IR 2 . The range of transformation w x lies in 
COPY1, the range of w 2 lies in COPY2, and the range of w 3 lies in COPY3. Thus, 
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Figure X.271. "Pic- 
ture" of the point-set at- 
tractor corresponding to 
the fractal system repre- 
sented in Figure X.270. 



256 




0 



256 x 



Table X.3. Recurrent IFS code. See Figure X.272. 



Wi 


a, 




Ci 


di 




fi 


Pn 


Pa 




Pn 


1 


0.5 


0 


0 


0.5 


0 


0 


0.3 


0.5 


0.1 


0.1 


2 


0.5 


0 


0 


0.5 


0 


128 


0.1 


0.5 


0.4 


0.0 


3 


0.5 


0 


0 


0.5 


128 


128 


0.3 


0.3 


0.2 


0.2 


4 


0.5 


0 


0 


0.5 


128 


0 


0.25 


0.25 


0.25 


0.25 



in the first copy unit transformation w 2 : □ -> □ in fact takes INPUT 1 to COPY2, 
whilst in the second copy unit it takes INPUT2 to COPY2. 

The images that are projected onto the copy screens are superposed; places where 
images come in from several inputs add up. For example, the total brightness of the 
image in COPY1 is p u times the total brightness of the image in INPUT1, plus p 3l 
times the total brightness of the image in INPUT3. 

Copy images are recorded on "photographic paper" and put back on their respec- 
tive inputs, as indicated in the figure. The system is cycled numerous times until the 
output images cease to change; then symbolically 



OUTPUT1 = p n ■ OUTPUT1 + p 31 • OUTPUT3 
OUTPUT2 = p n OUTPUT1 + p 22 • OUTPUT2 
OUTPUT3 = p 23 ■ OUTPUT2 + p 33 • OUTPUT3. 
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W 




256 



Figure X.272. Four 
affine transformations 
provide a just-touching 
collage of □. A recurrent 
IFS (Table X.3) made of 
these transformations is 
such that p 2 4 = 0. As a 
consequence, the attractor 
has no points in the square 
whose address is 24. 



Finally, the images OUTPUT 1, OUTPUT2 and OUTPUT3 are superposed. Symbol- 
ically, 

ATTRACTOR = OUTPUT 1 + OUTPUT2 + OUTPUT3. 



Color Plate 37 illustrates a measure theoretic attractor of a related Photocopy 
Machine. Color Plate 40 illustrates another way of approaching the computation of 
the invariant measure of a recurrent IFS, using cameras and screens. 
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Figure X.273. All of 

these attractors correspond 
to different choices for the 
transition probabilities in 
Table X.3. 




3 Collage Theorem for Recurrent IFS 

In this section we are concerned with the construction of fractal systems, based on 
recurrent IFS, with an emphasis on point-set attractors, that is, attractors that are sets 
of points rather than measures. Our goal is to describe how a collage theorem and 
associated machinery can be set up to enable fractal modelling using the point-set 
attractors of certain recurrent IFS. 

A recurrent IFS can described in terms of a Markov chain which acts on a code 
space built from the symbols {1,2,..., N}, as illustrated in Figures X.268 and 
X.270. The numbers p tj > 0, £f=i Pij = 1, give the probabilities of transfer among 
the symbols. One can imagine a particle moving from symbol to symbol following 
the discrete time Markov process. The process is defined, strictly speaking, to be 
recurrent if there is a finite probability of being able to move, on the directed graph, 
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Input 1 Input 2 Input 3 Figure X.274. Photo- 




copy Machine for realiz- 
ing the invariant measure 
of the recurrent IFS in 
Table X.l. 



Copies 1 , 2 and 3 are combined (superposed) to create Final Image 



from any given symbol to any given symbol. A good source of information on 
Markov chains is [Feller, Ch. 15]. 3 

We are concerned with the hyperbolic case, namely, 

d(Wi(x), Wi(y)) < sd(x, y) Vi, Vx, y e X 

where d is the distance function on X and 0 < s < 1 . In this case, we know that 
there exists a unique attractive invariant probability measure fi, which describes the 
random walk on X produced for example by the "Chaos Game" algorithm described 
in the previous section. Our focus in this section is on the structure of the support 
of fi, which we call A G X, the attractor of the recurrent IFS. This depends only on 
which pij are nonzero, the connection structure of the Markov chain, and otherwise 
not on the values of the p//s. 

We begin by recalling how the analysis proceeds in the case of IFS theory; this 
provides the simplest framework for explaining the recurrent case. Let (X, d) denote 
a compact metric space X, with distance function d. Let H denote the set of all 
nonempty compact subsets of X. 

Definition 3.1 d(x, B) = min yeB d(x, y) Vx g X, V£ g H. Note that 

BCC4 d(x, C) < d(x, B). (1) 



3 W. Feller, An Introduction to Probability Theory and its Applications (Wiley, London), 
1957. 
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Definition 3.2 d(A, B) = max x€A d(x, B) VA, B e h\ 

Note that this set distance is not symmetric. It has the following properties 

(i) d(A, C) < d(A, B). This follows at once from (1). 

(ii) d(A U B, C) = d(A, C) V d(B, C), where x V y = max{x, y}. This follows 
from the observation that 

d(A UB,C)= max d(x, C) = maxd(x, C) v maxd(x, C). 

xeAUB xeA xeB 

Definition 3.3 For all A, B eH, the Hausdorff distance is defined by 
h(A,B) = d(A,B)vd(B,A). 
We recall that (H,h) is a compact metric space. 
Lemma 3.1 For all A, B, C, D eH, 

h(A UB,CUD)< h(A, C) V h(B, D). 

Proof 

d(A UB,CUD)= d(A, CUD)V d(B, CUD) by (ii) 
< d(A, C) v d(B, D) by(i) 
<h(A, C)vh(fi, D). 

The same argument yields d(C U D, A U B) < d(C, A) v d(B, D) < h(A, C) v 
h(B.D). 

Let {X; wj, j = 1,2, ... , N] be a hyperbolic IFS, with d(wjx, wjy) < sd(x, y) 
Vx, y e X, and 0 < 5 < 1. Define 

by 

W(A) = (Jk; ; (A). 

Theorem 3.1 W :H-+ H is a contraction, with contr activity factor s, with 
respect to the Hausdorff metric; that is, 

h(W(A), W(B)) <sh(A, B) VA, B eH. 
Proof For any A, B eH, 
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h(W(A), W(B)) = hl\Jwj(A),(j Wj(B) 



N 

<\/ h(wj(A),Wj(B)) (by Lemma 3.1) 

N 

= \J{d (wj(A), wj(B)) v d {wj(B), Wj(A))} 
< \/ {sd(A,B) vsd(B, A)} 

= */l(A, fl) 

We leave it as an exercise for the reader to show that the following extension of 
the above result is true. We will need this fact later on. If (Xi,di) and (X 2 , d 2 ) are 
metric spaces, then (Hi, hi) and (H 2 , h 2 ) are the corresponding spaces of compact 
nonempty subsets, if 



obeys 



then 



defined by 



obeys 



Wj\Xi^X 2 for 7 = 1,2, ...,7V, 
d 2 (wjx, wjy) <sdi(x,y) Vx,yeX u 
W-.Hi^Hi 



W(A) = (jwj(A) 



h 2 (W(A), W(B))<shi(A, B). 

Corollary 3.1 There is a unique set A eH such that W(A) — A. 
A is the attractor of the IFS. 

Corollary 3.2 Collage Theorem for IFS. IfBeH obeys 

h(B, W(B)) <e >0 

then 

h(B,A)<--^-, 

(1-5) 

where A denotes the attractor of the IFS. 
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Proof By the contraction mapping theorem 

h(A, B) = h(B, lim W on (B)) = lim h (B, W on (B)) 

\ n-+oo / n->-oo 

where W o0 (B) = W(B) and we inductively define 

W o(n+1) (B) = W(W on (B)), n = 0, 1, 2, . . . 
But, by the triangle inequality, 

n 

h (B, W on (B)) <J2 h ( w ° (m ' 1} ( B ^ W om (B)) 

n 

= J2 h (W o(m ~ l) (B), W o(m - l \W(B))) 

m=\ 
n 

< ^s m ~ l h(B, W(B)) 

m=\ 

< (1 -sy l h(B, W(B)) 

We are now ready to extend the structure to the case of recurrent IFS. We actually 
make a generalization of the recurrent IFS structure to multiple spaces and set maps, 
suitable for the hyperbolic case where one is concerned with point-set topology is- 
sues. We are only concerned here with the connection structure of the chain. Let 
(Xj, dj) be compact metric spaces, j e {1,2, . . . , N}. Let (Hj, hj) denote the asso- 
ciated metric spaces of non-empty compact subsets which use the Hausdorff metrics. 
Let there be denned maps Wij : Hj HiV(i, j) e I where / is some set of pairs of 
indices with the property that for each i e {1,2, . . . , N} there is a j e {1, 2, . . . , N} 
with (j, j) e /. That is, = {j\(i, j) e 1} / 0 for each i e {1,2, . . . , N}. Further- 
more let 

hi (Wij(A), Wij(B)) < Sij hj(A, B) 

for some number s, 7 , V(i, j) e I, VA, B e Hj. By the remark following Theorem 3.1 
such maps can be built up from point maps taking Xj to X,. 

Suppose w, r : X X are contractive maps, where X is compact metric, and (p i} ) 
is row stochastic. Define (Xj,dj) = (X, d) for each j, and define Wij(S) = {wi(x) : 
x e S}, i, j = I, . . . , N . Let = {j : p }i > 0}. This embeds us in the more gen- 
eral setup that we now study. 

Let 

H = Hi x H 2 x H 3 x ••• x H N , 

and endow H with the metric h defined by 

h ((Ai, A 2 , . . . , A N ), (Bi, B 2 ,..., B N )) = max [hj{Aj, Bj)\j = 1,2,...,^). 

Then it is readily demonstrated by the enthusiatic reader that (H, h) is a compact 
metric space. 
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We think of H as consisting of a stack of clipped planes X\,X 2 , . . . ,Xn with a 
point in H being the N -tuple of one image in each plane. This idea is illustrated in 
Color Plate 41. Define 



by 



W(A U A 2 ,...,A N )=( (J w Xj (Aj), (J w 2j (Aj),..., (J w Nj (Aj)\ 

\j€l(l) j€l(2) jel(N) I 

For example, in the case N = 2 such a mapping might be symbolized 

-/AA/ 0 W 12 WA,\/ W 12 (A 2 ) \ 
V A 2 J ~ V W 2 i W 22 )\A 2 ) \ W21 ( AO U W 22 (A 2 ) ; 

We have now arrived at our goal; we are able to characterize W as a contraction 
mapping on H; we thus have all of the ingredients of a fractal system. 

Theorem 3.2 w :H-+H obeys 

h (W(A), < sh(A, B) VA, B eH, 

where s = max{s,- ; , (/, j) e /}. 

Proof To keep the notation succinct we suppose 

~ /W n W l2 \ 
\W 2l W 22 ) 

Then if A = (Ai, A 2 ) and B = (B\, B 2 ) we have 
h (W(A), W(B)J 

= h ((Wn(Ai) U W n (A 2 ), W 2l (A\) U W 22 (A 2 )) , 

(W„(Bi) U W 12 .(fi 2 ), W 2 i(Bi) U W 22 (fi 2 ))) 
= max{/ii(Wii(Ai) U W 12 (A 2 ), Wn(fli) U W 12 (fi 2 )), 

M^i(Ai) UW 22 (A 2 ), W 21 (fii) U W 22 (fi 2 ))} 

< max{MWn(Ai), W n (fli)) v /ii(Wi 2 (A 2 ), Wi 2 (fi 2 )), 
MW 2 i(Ai), W 2 i(fli)) v /i 2 (W 22 (A 2 ), W 22 (fi 2 ))} (by Lemma 3.1) 

< max{5n/ii(Ai, fii) v s l2 h 2 (A 2 , B 2 ), s 2i hi(A u fii) v s 22 h 2 (A 2 , B 2 )} 

< s(hi(A u Bx) v h 2 (A 2 , B 2 ))=sh((A u A 2 ), (B u B 2 )) 
= sh(A, B). 

The fact that W is a contraction mapping on ft means that we have a fractal 
system as described in the first section of this chapter. Conditions 1, 2, and 3, of 
section 1 apply. In this case the underlying space is X. Multiple copies of X are 
used to make up a space of models Y = H. (It may be that we only care about one 
"plane" in H.) A contraction mapping 0=WonWis built up out of elementary 
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transformations acting on X. Fixed points of such operators O provide the fractal 
approximations to the models. In the present case the specific structure is as follows. 
Corollary 3.3 When s < 1 there is a unique element 
A = (A l ,A 2 ,...,A N )€H 

such that 

Ai = Ujej^WijiAj) for i = 1, 2, . . . , N. 
That is to say, W (A) = A. A is the attractor of the recurrent IFS. 

Corollary 3.4 A Collage Theorem for recurrent IFS. IfBeH obeys 

h (b, W(B)^ < € > 0 

then 

h(B,A)<-^— 
(1 - s) 

where A denotes the attractor of the recurrent IFS. 

To connect this with the original single-space point-map recurrent IFS, we have 
Corollary 3.5 Let {X; Wj, (Pij), i, j = 1, . . . , N} be a recurrent IFS with X 
compact and the Wi's uniform contractions. Let A be the support of the unique sta- 
tionary measure [i mentioned in section 2. Then there exist unique compact sets 
A; C A, i = 1, . . . , N with A = U^A,- such that 

Ai = U j:pji> oWi(Aj), i = 1, . . . , N. 

In terms of the random walk produced by the Chaos Game Algorithm, as illus- 
trated in section 2, the A,'s may be characterized as follows: for all x, x € A, iff for 
every neighborhood G of x, for almost all trajectories x 0 , if,,(jco), ^ 2 ( w /i(^o))» 
we have i n = i and w in (. . . (w^xo) . . .) € G for infinitely many n. In other words, to 
"see" A, , just look at the points along a trajectory that end in map it>, . 

Let us use the above ideas to construct a simple machine (fractal system) for 
modelling some binary subsets of □. The machine is illustrated in Figure X.275. It 
uses affine transformations w l ■ : □ -> □, i — 1 , 2, 3. Place a copy of the subset T CD 
on (1), and place adjustable subsets U, V C □ on (2) and (3) respectively. Adjust the 
coefficients a t , b t , c, , di, <?, , /, and the two subsets U and V, so that 

T ~ wi(T) U w 2 (U) U w 3 (V) 
U ~ wi(T) U w 2 (U) 
V ~ wi(T)Uw 3 (V) 

Assuming that the system can be adjusted so that these equations hold approxi- 
mately, while the transformations remain suitably contractive, we know that the at- 
tractor for the system, our fractal model, which will be represented by a vector of 
sets (f,U,V), will be such that f ~ T. This simple machine generates an 18 pa- 
rameter family of fractal models for subsets of □. 



3 Collage Theorem for Recurrent IFS 399 




Another simple point-set fractal system is suggested by the diagram in Figure 
X.276. This type of machine generates recurrent fractal interpolation functions. The 
coefficients of the transformations are constrained to make sure that the resulting 
attractors are graphs of continuous functions, as follows: 

Wl (x 0 , y 0 ) = (jc 0 , yo), w 3 (x 2 ,y 2 ) = (x 3 ,y 3 ), w 5 (x 2 , y 2 ) = (*4, yd 
w l (x 2 ,y 2 ) = (x u yi), w 3 (x 0 ,y 0 ) = (x 2 ,y 2 ), w 5 (x 4 , y 4 ) = (x 5 , ys) 
w 2 (x 2 ,y 2 ) = (x u yi), w*(x 4 , y 4 ) = (x 3 , y 3 ), w 6 (x A , y A ) = (x 5 , y 5 ) 
w 2 (x 4 ,y 4 ) = (x 2 ,y 2 ), W4(x 6 ,y 6 ) = (x4,y4), w 6 (x 6 , y 6 ) = (x 6 , y 6 ). 
One example of a commercial fractal system, useful for educational purposes and 
for desktop publishing applications, is the Desktop Fractal Design System, [DFDS]. 4 
This is based on simple recurrent IFS theory using affine transformations in two di- 
mensions. This can be used to illustrate some aspects of modelling using point-set 
attractors of recurrent IFS built up of affine transformations acting in two dimen- 
sions. 



4 The Desktop Fractal Design System, Version 2.0, for Macintosh and IBM PC. Published 
by Academic Press, 1992. 
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Figure X.276. A frac- 
tal system constrained 
so that its attractors are 
graphs of continuous func- 
tions. 




Consider the IFS for a fern, as introduced in Chapter III. We refer to this as Fern 
1. Its IFS code is given in Table X.4 and it is shown in Figure X.277. 

In Figure X.278 we illustrate how one can create the same picture of Fern 1 by 
using a duplicate copy of the space □ with the same IFS acting on that space, and 
allowing a transformation to map from one copy of the space to the other. 

Look closely at the structure of Fern 1 . Notice how the main fronds alternate as 
we go up the main stem. As we saw in Chapter III, this same structure is repeated 
at all scales. If we zoom in on the tiniest frond, we will find that all the frondlets 
alternate up their respective stems. 

Now suppose we modify our more complex recurrent system by adjusting the IFS 
for the first copy of the fern, that is we adjust the affine coefficients so that Fern 1 on 
the first copy of □ becomes Fern 2, as illustrated in Figure X.279 and Table X.5. 

Notice that the fronds in Fern 2 are opposed instead of alternating. Finally, in 
Figure X.280, we show the effect of the modifications on the attractor as a whole. 
The original copy of the fern now has opposed fronds, some of which are made of 



f 



3 Collage Theorem for Recurrent IFS 401 



|Colors 
IPolys 



Condense 



Library 



0,0 



Map 1 



Map 4 



Fern 1 




-J^Map 3 
256,160 



Currently Selected IFS: 1 



Currently Selected Hap: None 



Figure X.277. A bi- 
nary fern image is the 
point-set attractor for a 
single fractal system con- 
sisting of an IFS of affine 
maps acting in 1R 2 . It is 
closely related to more 
complex fractals based on 
recurrent IFS theory. 




Figure X.278. This 
shows how the attractor 
in Figure X.277 is also 
the attractor of a recurrent 
fractal system, involving 
two copies of □ instead 
of one. The attractor of 
the IFS appears as part of 
the attractor of a recurrent 
IFS. 



Currently Selected IFS: 2 
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Figure X.279. The IFS 

acting on the first copy of 
□ is modified to create a 
new fern, Fern 2. 
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Figure X.280. The 

result of the modification 
of the first copy of the fern 
is to fundamentally alter 
the structure of the part of 
the attractor appearing on 
the first copy of Q 
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Table X.4. IFS code for Fern 1. 



w 


a 


b 


c 


d 


e 


/ 


1 


-0.02 


0.39 


-0.31 


0.30 


yt> 


1 no 


2 


0.68 


0.17 


-0.17 


0.68 


73 


11 


3 


0.05 


-0.80 


-0.22 


-0.19 


286 


131 


4 


0.02 


-0.36 


0.00 


0.38 


160 


97 


Table X.5. 


IFS code for Fern 2. 






w 


a 


b 


c 


d 


e 


/ 


1 


-0.02 


0.39 


-0.31 


0.30 


64 


116 


2 


0.64 


0.16 


-0.16 


0.64 


67 


15 


3 


0.00 


-0.80 


-0.22 


0.01 


237 


147 


4 


0.20 


-0.33 


0.00 


0.35 


144 


96 



smaller ferns with alternating fronds. We are learning to control a recurrent fractal 
model! 

Finally what happens if we feed the first copy back into some of the fronds on the 
second copy? The results of such an adjustment are shown in Figure X.281. In fact, 
in this example, the bottom-most fronds on each fern are affine transformations of 
the other fern. 

Color Plate 36 shows a fern image, and zooms on it, produced by a fractal system. 
In this case the fractal system involves four copies of □ and a recurrent structure of 
affine transformations of the type discussed in this section. In this section each of 
three successive zooms on one of the ferns reveals new structure. 

Another example of an attractor, consisting of four point-sets at once, in shown in 
Figure X.282. It is good to think of the vector of sets as a single entity. 



4 Fractal Systems with Vectors of Measures as Their Attractors 

In this section we are concerned with the construction of fractal systems, using 
recurrent IFS, whose attractors are vectors of measures. Our goal is to show how the 
collage theorem and associated machinery can be set up to enable fractal modelling 
using the measure theoretic attractors of certain recurrent IFS. That is, recurrent IFS 
can be used to make fractal systems suitable for modelling measures. 

Recall what happens in the case of an IFS with probabilities: for example, sup- 
pose X = □ c {X; ipi, w N ; p\, . . . , p N } is an IFS with probabilities, as in 
section 6 in Chapter IX, with the u>, 's contractive affine, and Y = V(X); then we 
can define a corresponding fractal system by taking O to be the associated Markov 
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Figure X.281. A frac- 
tal system, based on a 
recurrent IFS structure, 
produces two ferns whose 
structures are woven to- 
gether. Neither could exist 
without the other! 




operator as defined in section 6 in Chapter IX; that is O : Y -* Y is denned by 

O(v) = M(v) = p\v o w\ l + p 2 v o Wj 1 H h Pnv ° u;^ 1 . 

Then we know from the theory of Chapter IX that O is contractive in the Hutchinson 
metric. 

What is the analagous structure in the case of recurrent IFS? We use the notation 
of the previous section, but restrict attention to the case of measures supported on 
three copies of X. Instead of using the measure space V(X), we work with the space 
of normalized Borel measures V that are defined on Borel subsets 5 of X x X x X 
of the special form 
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Figure X.282. Affine 

maps, linking four copies 
of □, lead to a recurrent 
structure consisting of 
four linked sets. Adjust- 
ments of the coefficients 
lead to a wide variety of 
fractal modes in this frac- 
tal system. 



S = (S u S 2 , S 3 ) e B(X) x B(X) x B(X). 

where B(X) denotes the Borel subsets of X. A point in V is described by a vector of 
measures 

p, = (am, fl2, fl3), 
where each /x,- is a Borel measure on X, and 

-■1 f ii(X) + f i 2 (X) + fi3(X) = l. 
Then the measure of vector of subsets S = (S\, S 2 , S3) is 
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A(5) = /xi(Si) + ti 2 (S 2 ) + M3(5 3 ). 
We define a generalized Hutchinson metric d on V by 

v) = sup {]£ (j[ /< ~ J % fi dv ) : ~ f = /2 ' /a)GL }' 

for all jl,v efi, where 

L = { / = (/i, / 2 , h) - fi : : X -> 01 continuous, 

and |/,(*) - < </(*, y) V*. ye\,i = l, 2, 3}. 

Then, from the fact that (V(X), d H ) is a compact metric space whenever (X, d) is a 
compact metric space, it follows that (V, d) is a compact metric space too. 
We are now ready to describe generalized Markov operators M, 

M-.V-+V, 

that are associated with recurrent IFS. We illustrate the idea for recurrent IFS such 
as the one in Table X.l, which is the subject of Figures X.268 and X.274. We simply 
define 

Our general expectation is that such operators are contractions on (P, d) when the 
transformations iu,- are contractions. 

We demonstrate that M has contractivity factor s when p t j — pj, and the w, : 's are 
contractive with contractivity factor 0 < s < 1. We will need the identity 

jf fjd (m(v);) = PV J x fj ( w J (x) ) dVi(x) > 
which follows from a change of variables. Then 
d (M(v), Af(£)) 

(Jjfjd(M(P.)j)) - jf fjd(M(v)j)j : / G L , 

= SU P I / J2Pijfj( W j( X ^ d ^i ~ / ]C Pijfj( W j( X » dv i : f E L ' 

[AY A «7 

= sup - J^Pjfj(Wj(x))d Vi : / G lJ 

Let, for jc G X, 



= sup 
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f(x)=^- 1 ^p j fj(Wj(x)),s- l ^Pjfj(Wj(x)),s l Y,Pjfj( W + x)) ^ ■ 
Then / e L. Let 

L = {/ g L : / is of the special form of / above, for some /eL). 
Then we can write 

~d{M{v), iif(A)) = sup j Jdjx-s j /^:/glJ. 

Since L c L, it follows that 

d(M(v),M(jl))<sd(v,jX). 

This completes the demonstration of the contractivity of the recurrent "multilevel" 
Markov operator, in the special case pij = pj. 

However, in the general case where the transition matrix P = is not con- 
strained as above, the contractivity of the Markov operator is controlled not only by 
the contractivities of the iu/'s but also by the behavior of the transition matrix. To 
understand this, consider what happens when the space X consists of a single point. 
Then V becomes the set 

[jX = Gui, (i2, A*3) el 3 :/ii + ^2 + W=Ui> 0, ii 2 > 0, A*3 > 0}. 
That is, a typical measure £ e V is described by a vector of length three and total 
mass one. The Hutchinson distance between two such vectors is simply the sum of 
the absolute values of the differences between the components as follows: 

d(ji = 0*i, A*2, A*3), v = (vi, v 2 , i*0) = \fii ~ vi| + |A*2 - v 2 | + lA*3 - vsl- 
Then the Markov operator M : V -> P is given by 

= 0*1, A*2, V?,)P = OuPll + /*2P21 + /*3P31, PtlPl2 + A*2/?22 + A*3/?32, 
Ati/?13 + A*2/?23 + A*3/?33), 



where 



It follows that 



P = 



Pll Pl2 Pl3 
P21 P22 P23 
_ P?>\ P32 P33 _ 



d(M(v),M(jD^ <5(v,£). 

However we cannot, in the general case, assert a contractivity factor smaller than 
one, as can be seen by considering the considering the case 

1 - 2e e e 
e 1 - 2e e 
€ € 1 - 2e 
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where e is very small and positive. For this transition matrix, we find that 

d (M((l, 0, 0)), M((0, 0, 1))) = (1 - 3€)d ((1, 0, 0), (0, 0, 1)) . 

However, so long as the Markov operator has a unique fixed point, there always 
exist metrics such that the operator is contractive, in fact, with an arbitrarily small 
contractivity factor. The problem is the identification of such a metric, one that is 
simultaneously intuitively or visually accessible. 

The operator M : V(X) -> V(X), associated with an IFS with probabilities, 

M(v) = P\v o w^ 1 + piv o w^ 1 H h Pnv o wj, 1 , 

is an example of a measure preserving transformation on a metric space, constructed 
from contractive transformations. 

Given a set of spaces, {X k : k = 1, 2, . . . , N}, we can construct measure preserv- 
ing transformations M, 7 : V(Xi) -> V(Xj), according to 

Mij(v) = p U iV o Wjj\ + p ij2 V OU)-/ 2 + --- + Pij Ni jV o w„*, 

where w ijn :Xi^Xj for n e {1, 2, . . . , My} is contractive between the metric 
spaces (Xi, di) and (Xj, dj) according to 

dj (w ijn (x), w ijn (y)) <sdi(x, y) for all x, y e X t , 
and the /?i 7n 's being positive numbers such that 

Y^Pijn = l- 
n 

(One can think of [X it Xf w ijn , p ijn , n = 1, 2, . . . , AT, 7 } as being an "IFS" that acts 
between the spaces X, and Xj.) We now introduce an N x N transition matrix 
p _ (p.^.) t h a t is row stochastic, and gives the probability of transit! n from space 

Xi to Xj. Then we can define a Markov operator M :V^V, where V = x 
V(X 2 ) x ... x P(X N ),by 

^T(Ati, At 2 , • ■ • , Atw) = (^PnMiiifXi), Y^PiiMuim), ■ ■ . ,^2piNM iN ((n)j . 

In general we expect such operators to be contractive, and to provide the basis for 
useful fractal systems for modelling vectors of measures, and in particular, grayscale 
photographs. An example of such a system is VRIFS™, 5 which is aninteractive image 
modelling system that uses a fractal system built from operators of the type of M 



5 VRIFS™, which stands for Vector Recurrent Iterated Function System, is an interactive 
measure theoretic fractal system for modelling images in two dimensions. It runs on Sun 
workstations and uses four-dimensional vectors of measures under affine control. It is avail- 
able from Iterated Systems Inc., Commercial Division, 5550- A Peachtree Parkway, Norcross, 
Georgia, 30092, USA, telephone number (404) 840 0633. 
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Figure X.283. A spe- 
cial Markov operator acts 
on a vector of measures. 
Its attractor is a stack of 
images, including a split 
root. Fractal systems of 
this type are good for frac- 
tal modelling of grayscale 
images. 



above, with all transformations being two dimensional affine transformations. An 
example of an image generated by VRIFS™ is shown in Figure X.283. See also Color 
Plates 37, 38, and 39. 
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Answers to Chapter II, section 1 

II.1.8 The Riemann sphere is not a vector space because of the action of the 'Point at Infinity'. To be a vector space, each 
point must have an additive inverse, that is, another point which summed to it yields the additive identity element 0. Since 
there is no such inverse for oo, the space is not a vector space. Note that the usual check a(x + y) = ax + ay does hold 
however. 

The examples in 1.7 are not vector spaces for the simple reason that they are not closed under algebraic operations. 
That is, multiply any element in X by a large enough scalar, and it will not be in the space. The example in 1.6 has the 
same problem. If one defines multiplication by scalars in codespace in a reasonable way, namely multiply the distances by 
multiplying the codes and 'carrying' one will find a sequence for which there are no codes. (Multiply x = (N — \)(N - 
l)--by2). 

H.1.9 By definition {x} C X if every element in {x} is in X. The only element in {x} is x which by assumption is in X. 
Hence jx}cX. 

Answers to Chapter II, section 2 

H.2.1 We use extensively the inequality \x\ + \y\ > \x + y\. It is proven by separating it into three cases: a) x, y > 0. In this 
case | jc | = x, \y\ = y, and since the sum of two positive numbers is positive, \x + y\ = x + y = \x\ + \y\. b) x, y > 0. In this 
case |;c| = -x, \y\ — -y. Since the sum of two negative numbers is negative, \x + y\ — —x - y = \x\ + \y\. c) x > 0, y < 0. 
By changing the roles of x , y this proves the other possibility as well. \x\ =x, \y\ = -y. \x + y\ = \x - (-y)\ = \\x\ - \y\\. 
This latter number is either equal to \x \ - \y \ or its negative, and both are less than or equal to \x \ + \y \ with equality only 
if one or the other are zero. Hence \x + y\ < \x\ + \y\. We proceed to verify the metric space hypotheses: 
(a) 

(i) d(x, y) = \x - y\ = \-(x - y)\ = \y -x\ =d(y, x). 

(ii) x - y # 0 unless x = y. It is not infinite if x, y are not infinite. Since taking the absolute value does not alter either 
of these properties 0 < d(x, y) < oo. 

(iii) d(x,x) = \x-x\ = |0| =0. 

(iv) d(x, y) - \x - y\ = \x - z + z - y\ < \x - z\ + \z - y\by means of the proof above. 

(b) (i), (ii), (iii) can be verified by multiplying everything in part a) by 2. (iv) d(x , y) = 2\x - y\ = 2\x - z + z - y \ < 
2(\x-z\ + \z-y\ = 2\x-z\+2\z-y\. 

(c) All properties can be done by a) substituting x 3 , y 3 , z 3 everywhere in the above arguments. 

11.2.3 d(x, y) = |jry| is not a metric for OL Choose x = 0 and y # 0. Then x # y, but d(x, y) = |0 • y\ = 0, which violates 
property (i). 

11.2.4 On the punctured plane [R 2 \ O define d(x, y) = |r, - r 2 | + 9. Then for two points x, y, we have 

(1) d(x, v) = |r, - r 2 + 9 \ = |r 2 - r,| + 9 = d(y, x) since 9 is still the smallest angle subtended by the two rays. 

(2) If x ^ y and |r, - r 2 | = 0 then 9 # 0 so d(x, y) > 0 if x # y. 9 < tt so that for d(x, y) - oo we must have either r, 
or r 2 infinite. But r, = x] + x\ and r 2 = y\ + y\ so if either is infinite so is either x or y. Hence 0 < d(x, y) < oo if 
x 7^ y. 
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(3) d(x, x) = \r - r\ + 0 = 0 + 0 = 0. 

(4) d(x, z) + d(z, y) — In — r 3 | + |r 3 — r 2 \ + 0\ + 0 2 . 0\ + 0 2 > 0 if the ray to z is not between, the other two, and 
0, + 0 2 = 0 otherwise, so d(x, z) + d(z, y) > |r, - r 3 | + |r 3 - r 2 \ + 6 > |r t - r 3 + r 3 - r 2 | + 0 — \r x - r 2 \ + 0 = 
d(x, y) by problem 1 above. 

11.2.5 If x, y are not equal, then there is an jc,, y t pair which do not agree. Hence |x, - yi\/(N + 1)' ^ 0. Since all the 
terms in the sum are positive or zero, there is no cancelling, and therefore d(x, y) ^ 0. We can switch the order in each term 
because of the absolute value signs, consequently d(x, y) = d(y, x). The series converges since the numerators are less than 
2Af so the series is dominated by 

OO 1 

2N-Y — 

(N + 1)< 

which converges for all N < oo. Because the series converges, the proof of all the metric requirements, except the above, 
follow from problem on each term, and therefore for the whole series. 

11.2.6 The easiest way to show that this equation for a distance measure satisfies metric properties turns out to be that we 
show that it really is a Euclidean metric, in the usual sense. Let x be an element of E the code space on N symbols. Then 
the series 



Xi 



converges to a number in [0, 1]. In fact it is the (N + l)-ary expansion of a real number in this range, having the property 
that it has no digits equal to N, because the code space symbols range from 0, . . . , N — 1. For instance, when N = 9 we 
have 



Xi X\ x 2 

— = — H ~r ■ ■ ■ = •X\X 2 x~i . 

10' 10 100 



Then the mapping £ -> [0, 1] given by 



=£ 



Xi 



(n + iy 



is a metric equivalence, with constants c\, c 2 in the definition equal to 1. Using this map, let (x,y),(u, v) be two points 
in the space of shifts on N symbols. By the above equivalence, we have corresponding points (x, y), (u, v) in the square 
[0, 1] x [0, 1] C Since the series converge, we have 

v Xj v U, (X, Uj) 

X ~ U ~ (N + 1)' _ (N + 1)'' _ ^ (N + 1)'' 
The metric in the statement of the problem can therefore be written as 



d((x, y), (u, v)) = y/(x - u) 2 + (y- v) 2 
which is Euclidean distance in IR 2 , and hence a metric. 

11.2.7 In body space, the shortest distance between two points is the length of the path through the body. The shortest 
distance from a point to itself is then 0. The shortest distance between two points is the length of the path which is also the 
shortest path in reverse order. The length of a path between two distinct points cannot be zero, and a path of infinite length 
inside the body must not be the shortest path between two points. Finally the shortest path between two points in the body 
is less than the path which takes a detour to a third point unless that point is on the shortest path already. Hence d(x, y) is a 
metric. A 

11.2.8 For a function d which is not a metric, see for example exercise 2.3. For the annulus to look like a cylinder, we must 
have the distance around any concentric path look the same. This implies that we need the distance around all concentric 
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paths equal. Use the angle subtended between two points to do this. To look like height, we need a linear rad ial distance, use 
the radius. If we want to look like the surface of a cylinder with Euclidean metric (unrolled), we thus use V (r x - r 2 ) 2 + 9 2 . 
The proof that this is a metric follows exercise 2.4. 

11.2.9 That a distance measure formed by great circle distance is a metric is true on any two-sphere. We verify the conditions 
as follows: 

(i) The plane defined by the center C of the sphere and any two distinct points x,y on its surface cuts the sphere on the 
great circle path between the two points (that is its definition), so there is one and only one great circle distance between the 
two points. This distance is r9 where 9 is the interior angle xCy. Hence d(x, y) = 0 if and only if 6 = 0 that is x = y. 

(ii) Because we are dealing with interior angles, the angle xCy and the angle yCx are equal hence d(x,y)= d(y, x) > 0. 
The maximum value of an interior angle is it, hence d{x, y) < rit < oo. 

(iii) To prove the triangle inequality, we note first that if x, y, z lie on the same great circle, then if z is interior to 
x,y, d(x, z) + d{z, y) = d(x, y), whereas if it is exterior then since an exterior angle is larger than an interior one, 
d\x] z ) + d(z, y) > d(x, y). Consequently, we deal only with z not on th e great circle defined by x, y. For this case, the 
1R 3 distance between the points x and z is given by yjr\ + r\ - 2r x r 2 cos 6 by the law of cosines. Without loss of generality, 
n = r 2 = 1 and the square root is monotonic increasing, so that this distance is proportional to 1 - cos 6 and hence decreases 
for decreasing 9. Consider the plane triangle in 1R 3 defined by x,y,z. By moving z towards the line xy, we decrease 
the distances between x and z and between z and y (we are moving down the perpendicular from z). Since the above 
law of cosines formula insists that these distances can only decrease if the corresponding angles xCz and yCz decrease, 
the distances on the sphere are also strictly decreasing. This implies that d(x, z) + d(y, z) > d(x, y) in this case. Hence 
the triangle inequality holds for great circle distance on any sphere. When this sphere is interpreted as C, the great circle 
between oo and any other point is along the projection of vertical lines in the complex plane. Hence since 1 + i is above the 
real axis, its distance to infinity is less than that of zero, which is n/2. 

11.2.10 Suppose that we have 

c\d x {x, y) < d 2 (x, y) < c 2 d x (x, y) 
where c x , c 2 are positive real numbers. Using the first inequality, we have 

d x (x,y) < —d 2 (x,y) 

and the second gives 

— d 2 (x, y) < d x {x, y) 
c 2 

Let e x — — and e 2 = ^, and combine the above two inequalities, we get 

e\d 2 (x, y) <d\(x,y)< e 2 d 2 (x, y) 

as desired. 

11.2.11 Yes in both cases. Let d x (x, y) be Manhattan metric,d 2 C*, y) be Euclidean, and use, for instance c x = 1/2, c 2 = 2. 
Since the Manhattan metric is at most sfl times the Euclidean metric (when they lie on a line at a 45° angle) these constants 
will do fine. 

11.2.12 Two metrics are uniformly equivalent if and only if there are constants c u c 2 such that 

c x d x (x,y) <d 2 (x,y) <c 2 d x {x,y) 

for every pair x, y in the space. To see that the metric d(x, y) = In - r 2 \ + 6 is not equivalent to the Euclidean metric on 
the annulus, choose d x to be the Euclidean metric, and d 2 to be this one. Let x, y be on the real line x < 0 and y > 0. Then 
d 2 (x, y) > 9 = n. Regardless of the value of c 2 by moving x, y towards the origin we can insure that d x (x, y) is less than 
any e by letting x = (-e/3, 0) and y = (e/3, 0). Hence c 2 d x (x, y) < c 2 e. Then if e < c 2 /n the inequality fails. Hence the 
two metrics are not equivalent. 
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11.2.13 Use h(x) = x-l. Then d x (h(x), h(y)) = 2 • \(x - 1), (y - 1)| = 2 • \x - y\. This metric is equivalent to d x {x, y) 
by using C\ — 1/4, c 2 = 1. 

11.2.14 To show that (^Manhattan) and (B,Euclidean) are equivalent metric spaces, use h{x) =x. It is easiest to see 
the equivalence by making the substitution x' = 0, y' = y - x = y x - x x , y 2 - x 2 . Then the Euclidean distance d(x, y) = 

d(0, y') = yjy\ 2 + y 2 = r . The Manhattan distance in the same notation is \r cos 9 + r sin 9 1. Maximizing the last expres- 
sion with respect to 9 yields 9 = n /2 or cos 9 = sin 9 = Vl/2. So the maximum difference in size between the two metrics 
is dm = V2d E . Then 

-^=d M (x, y) < d E (x, y) < d M (x, y). 
V2 

11.2.15 Using h(z) = h{ Z \ + iz 2 ) = (zi, z 2 ), we have d(x, y) = d E (x, y) on 1R 2 . The above problem gives the metric equiv- 
alence for this with the Manhattan metric. 

11.2.16 Suppose they were equivalent. Then there are constants c x , c 2 such that 

c\d\(x, y) < d 2 (x, y) < c 2 d x {x, y) 
Let x — 1 and y = ^py- Then d 2 (x , y) = 2c 2 d\(x, y), a contradiction. Hence they are not equivalent. 

11.2.18 If two metric spaces are equivalent, there is a one-to-one, onto map h : Xi — ► X 2 such that 

c\d](x, y) < d 2 (h(x), h(y)) < c 2 d\(x, y) 

The second inequality implies that if 

d x (x, y) < 8 = e/c 2 then d 2 {h{x), h(y)) < c 2 8 = a. 
Since h is invertible, we may write the first inequality as 

d x (h-\x),h' x {y))<-d 2 {x,y) 

for points x,y e X 2 . Then d x {h~\x), h~ x {y)) < e whenever d 2 (x, y) < 8 - c x e. Hence h and its inverse are continuous, so 
h{x) is a homeomorphism. 

11.2.19 Intuitively, these metrics behave similar to base 10 decimals. In fact, if we look at Ei 0 the symbols are 0-9, and by 
choosing k = N = 10, we get decimal expressions (with some ambiguity). To show that these metrics are topologically 
identical, we must demonstrate that the identity map i(x) =x is a homeomorphism. This is equivalent in epsilon-delta 
format to showing that we need to show that for every x, a > 0 there is a 8 such that for every y in Eyy the two statements ' 

d( x (x,y) < S=>d k2 (x, y) <e 

and, 

dk 2 (x,y) <8=>d k] (x,y) <e 

are true. 

We can simplify matters by breaking the proof into cases and reducing it to a single statement to prove as follows: 
Case 1: ki = k 2 . In this case there is nothing to prove, since a metric is always equivalent to itself. 
Case 2: ki < k 2 . In this case, by comparing terms of the two series, since k\ > k\, d kl (x, y) < d kl (x, y) for any x, y with 
equality holding only for x — y. Consequently, we may choose 8 = e in the first statement: 

d kl (x , y) < € d k2 (x , y) < d kl < € . 

and we need only prove the second statement, namely 

d k2 (x, y) <8=> d kx {x, y) < e 
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Case 3: This case requires only the proof of the first statement for the same reasons as case 2. There is no loss in just 
assuming case 2, and reversing the subscripts for case 3. 
Consequently, all we need to do is show that if < k 2 , 

d kl {x,y) <<5i =>d kl (x,y) < <? 

and then take 8 = min^, e). 

Looking at the analogy between decimals and we can get the distance d k (x, y) arbitrarily small if *, = up to some 
big number i'o. This would correspond in decimal arithmetic to the first i 0 decimal places being equal to zero in the distance 
formula. Writing it without the analogy, since the series is less than 



OO 1 

£-^i hi 



i = l 



then 

y ^f- N __ c 

L-^i fci L—t hi hi 

i=i 0 i=i 0 



k l k l k> 



To insure this is less than, say a we only need to choose i 0 large enough. This is nice, we can get a set we can describe by 
just using the digits that put us inside a given B e (x). What we don't know is whether it contains a ball inside it (For large 
enough N it is a ball, but we don't know that). On the other hand, we can look at the smallest possible distance we could get 
if any of the first io "digits" were not zero. This would be the distance between an x and a y with only the i 0 digits different, 
namely 



and Xi = yi whenever i ^ i 0 . This distance is 



\ x i 0 -yi 0 \ = 1 



dk(x ' y) = lk 



So any ball B s (x) with 8 < l/k'o is guaranteed to have no elements which disagree on the first i 0 places. 
We now proceed as follows: Choose i 0 such that the set 

B = {y : x, ; = yi if i < / 0 } 

is inside of the d k{ ball B f (x). Now choose 8 0 such that the d k2 ball B So (x). Since all elements of this d kl ball have at least i 0 
"digits" of agreement with x, everything in the ball is in the d kl ball of radius €. Therefore, 

d k2 (x, y) <8 0 => d kx {x, y) <€ 

which was all we decided we needed to prove. 

Answers to Chapter II, section 3 

11.3.1 By Definition 3.3, if {*„} is a Cauchy sequence in (X, d) a complete metric space, then it converges to a point x € X. 
This implies that given an € > 0 there is an Af such that if n > N then x n € B(x, e). Since there are infinitely many integers 
greater than any integer Af, x n € B(x, e) for infinitely many integers n. 

11.3.2 We will use the property of (R., Euclidean) that every set bounded above has a Least Upper Bound (LUB). This 
implies that a monotone increasing sequence {/„} which is bounded above has a limit. Since the LUB cannot be less than 
/„ for any n (the sequence is increasing) and there is no minimum distance to all the /„ (since it is the LUB) it follows that 
for any e > 0 there is an Af such that x„ € B(LUB, e) for n > N. We use this property to prove completeness of (K, d) as 
follows: Let e, = 1/2' and Af, be for each i a number such that Af, > N t ^\ and d(x n , x m ) < e, whenever n, m > A^,. It follows 
that there is an interval of length e, such that all the x n are in this interval if n > N t . Let /, be the greatest lower bound 
of the set [x„ : n > N,} (which exists by subtracting everything from a large number and using the LUB principle). The 
sequence {/, } is then monotone increasing, and is bounded above by the LUB of the same set of x„ which we call L, . Then 
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this sequence has a limit which we will call jc. We claim that x is the limit of the Cauchy sequence as well. Given e > 0, 
B(x, € ) D [/,-, Li] for some i, hence if n > N h x n e B(x, e) which means x is the limit of {x n } and the sequence converges. 
Hence (R, Euclidean) is complete. 

11.3.3 We show that R 2 is complete by using the fact that R is complete. Let {(x n , y,,} be a sequence in R 2 which converges 
to (x,y). Then {jc„} jc, {y n } y. The converse is also true in R", since given e > 0 we can find a small rectangle 
B(x, e x ) x 5(jc, €y) C 5((jc, y), e). In turn, we can find N x , N y such that x n € O \in>N x and y„ € 5(y, e y ) if 
n>N y . Choose W = max(JV„ N y ). Then for n > N, (x n , y n ) € y), Q since it is in the small rectangle inside. With 
this in mind, let {(jc„, y„)} be a Cauchy sequence. Then because y/{x n - x m ) 2 + (y„ - y,„) 2 < e implies \x n -x m \<€, \y„ - 
y m | < e, {jc„} and {y„} are each Cauchy sequences in R. By problem 2, these converge, so {(*,,, y„)} converges. Hence R is 
complete. 

11.3.4 There are two ways of approaching this. We can either repeat the arguments in the answers to exercises 3.2 and 3.3 
form, or we can peek ahead to the definition of a closed set. If (X, d) is a complete metric space, the only way a Cauchy 
sequence in a subset can fail to converge in the subset is if it converges to a limit which is not in that subset, because viewed 
as a Cauchy sequence in X, it has a limit. The limit of this sequence is thus a limit point of the subset, and hence if the set 
is closed, it is in the set. So any closed subset of a complete metric space is complete. ■ is a closed subset of R 2 so it is 
complete by problem 3. 

11.3.5 The sphere can be thought of as an embedded closed subset of R 3 . The spherical metric is equivalent to the R 3 
Euclidean metric. To see this, note that the Euclidean metric, used on the sphere gives to each pair of points x, y the length 
of the chord between them. This length, for a sphere of radius 1 is V2-2cos0 (by the law of cosines). Using the constants 
2/n and 1 /n this is a metric equivalence to the distance measure of great circles. If {*, } is a Cauchy sequence on the sphere, 
it is in R 3 as well by this equivalence, consequently it has some (R 3 ) limit x. Since the sphere is a closed subset of R , it 
contains its limit points, and therefore contains jc. Hence the sequence has a limit, and the sphere is complete. 

11.3.6 Let {jc a } be a Cauchy sequence on the codespace S on N symbols. Then given e > 0 there is an M such that if 
a, , o 2 > M then d{x a ^ , jc a2 ) < e. In particular, if we choose e = ^ then since any of the first i terms must either be 0 or 
contribute more than this value, x av x„ 2 must agree on the first i terms. Choose x such that for each i, the first i terms of x 
agree with all those elements of {jc a } with a > M„ where M, is the M corresponding to the above choice of e. Then x e E 
and for each e > 0 there is an M such that d{x, x a ) < e if a > M . In other words x is the limit of {x a }. Hence codespace is 
complete. 

11.3.7 Let {ft) be a Cauchy sequence on (C[0, 1], D). Then for each s e [0, 1], the sequence {f,(s)} is Cauchy in R, since 
given € and N guaranteed by the assumptions, d(fi(s), fj(s)) in the Euclidean metric is certainly less than the maximum 
value over s. We will anticipate a little and call the limit of each such sequence for each s f(s). As defined, f(s) is clearly 
a function on [0, 1], we must show that it is continuous. Given e > 0 there is a 8 > 0 such that if d(s { , s 2 ) < 8 then for any 
i D(fi(si) fi(s 2 )) < e/3. This property, called equicontinuity, is true for bounded functions on a closed interval, such as 
we have. Then there is also an N such that if i > N d(Ms x ), /(s,)) < <?/3 and d(Ms 2 ), f(s 2 )) < 6/3 since the sequence 
converges. Consequently, for our choice of 8, 

d(f(s { ), f(s 2 )) < d(fi(sd, /(*.)) + d(fi(sd, fi(s 2 )) + d(f,(s 2 ), f(s 2 )) < € 

by the triangle inequality, and hence f(s) is continuous. Then / € (C[0, 1], D) and the space is complete. 

11.3.8 Let (Xi.di) and (X 2 ,d 2 ) be equivalent metric spaces. Then there is a homeomorphism h:X\-*X 2 such that 
~d(x, y) = d 2 (h(x), h(y)) is equivalent to d x {x, y) on X,. Let {x n } be a Cauchy sequence in X 2 , and let € > 0, and N large 
enough that d 2 (x n ,x m ) < € whenever n, m > N. There exists a sequence {y„} in X, such that h(y n ) = x„Vn, and by the 
metric equivalence, we have that 

d\(y„, y m ) < c 2 d{y n , y m ) = c 2 d n (x n , x m ) < c 2 € 

Consequently, by specifying 8 = c 2 €, for any € > 0 there is an N such that d 2 (x n , x m ) < 8 which forces d,(y„, y m ) < 
€Vn,m > N, in other words {y„} is a Cauchy sequence in X,. This has a limit y because X, is complete. Since for all 
8 > 0 there is an N such that d,(y, y n ) < 8 for n > N, then since (d)(y, y n ) < (l/c,)d,(y, y„) < (l/c,)5 = e, it follows that 
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d(y, y n ) — d2(h(y), x n ), {x n } — >• h{y) — x and the sequence converges in X 2 . Hence X 2 is complete. In the process, if two 
spaces are metrically equivalent, then a Cauchy sequence in one corresponds to a Cauchy sequence in the other under the 
metric equivalence. 

11.3.9 Given two points x = (xi,x 2 ),y = (ji, y 2 ) in (■.Manhattan), any path which consists of straight horizontal and 
vertical segments, which are monotonic in each coordinate have the same length. Since the sum of these lengths is equal 
to the distance between the two points, they are all shortest paths. 

11.3.10 Suppose {x n } — >• x in the space X. Then given e/2 > 0 there exists an N such that d(x, x n ) < e/2 if n > N. Choose 
two such points x n , x m then by the triangle inequality 

d(x n , x m ) < d(x n , x) + d(x, x m ) < e 

So given e, if n, m > N,d(x n ,x m ) < e, in other words, the sequence is Cauchy. 

11.3.11 Suppose {x„} — >• x and {x„} — >• y. Then given e/2 > 0 there is an Ni such that d(x n , x) < e/2, and an N 2 such that 
d(x n , y) < e/2. Choose N = max{N\, N 2 ). Then forn>N 

d(x, y) < d(x, x n ) + d(x n , y) < e 

Consequently, for any e > 0, d(x, y) < e. In other words, d(x, y) = 0, or x = y. 

11.3.12 Problem is misstated. 0 is the limit regardless, in the second space the sequence does not converge because 0 is not 
in the space. 

11.3.13 Define the sequence {x„} by x n = aVn. This sequence is clearly Cauchy, d(x n , x m ) = 0 < e for any e > 0 and any 
n, m. It also converges since d(a, x n ) = 0 < e for all n and any e > 0. The space contains no limit points since a limit point 
is defined as being the limit of a sequence but not a member of the sequence. Since this is the only sequence in the space, 
the space contains no limit points and therefore it contains all its limit points (there are none) but is not perfect since a is not 
a limit point. 

11.3.14 {x n = n] does not converge in (R, Euclidean). Suppose it did. Then by Theorem 3.1 this is a Cauchy sequence. But 
for any n^m d(x„, x m ) > 1 and thus is not less than any given e > 0 for any N. For (C, spherical), the distance to the 
Point at Infinity is calculated by looking at the tangent of the angle subtended by the real line and the projection through the 
sphere. The radius of the sphere is 1 so this tangent is l/n. Given e > 0 we may choose N such that 1/ N < e, and therefore 
ifn>N d(oo, x n ) = l/n < e. Hence the sequence converges in (C, spherical). 

11.3.15 Let x be a limit point in X t . Then there is a sequence {x n } — >• x, x ^ x n Vn, in X] by definition of a limit point. 
Since h provides a metric equivalence, given e > 0 there is a S > 0 such that if n > N, di(x, x n ) < S = (l/ci)e implies that 
d 2 (h(x), h(x n )) < e. Hence since h is onto {h(x n )} — >• h(x) e X 2 .' Since h is 1-1 and onto, h(x n ) ^ h(x)Vn. Hence h{x) is a 
limit point in X 2 . 

11.3.16 (^Euclidean) is defined as being those points whose x and y coordinates both lie in [0, 1]. The points in this set are 
all greater than 1 in the y coordinate, hence are not in ■. The set, taken in U 2 has two limit points in ■ (it does not converge 
as a sequence). These are at (1, 1) and (—1, 1). 

11.3.17 Any point of the set S is the limit of a sequence which converges because eventually all the terms are the same. This 
type of sequence is always possible (see exercise 3.13), however such convergences do not qualify the point as a limit point 
of the set. The only such point would be 0, since this is the only Cauchy sequence converging to point not in the sequence. 
Since this point is not in the space (0, 1], the set of limit points of S is empty and therefore a subset of S. Hence S is closed 
in (0, 1], Euclidean). 

11.3.18 Let S = [0, 1] C (IR, Euclidean). Let x e S, and let a = min(jc, 1 — x). If a — 0, set a = 1. In other words, a is the 
least distance to an end of the interval. Then the sequence {x n — x + «/2"}^ =1 lies entirely in S. Given e > 0 there is an 
TV such that e > a/2 N . This implies that for n > N, d(x, x n ) =d(x,x — a/2") = a/2" < e, in other words thiSC sequence in 
[0, 1] converges to x, hence x is a limit point. Since x was any point in the set, the set consists entirely of limit points. It is 
closed, and therefore perfect. 

11.3.19 By reference to exercise 3.17, this set contains its only limit point, 0. It is not perfect since for any n there is no 



i 



Selected Answers 423 



Cauchy sequence (and hence no convergent sequence converging to \/n which does not contain \/n. Hence \/n is not a 
limit point, and the set is not perfect. It is closed, since it contains it's limit point. 

11.3.20 That E contains its limit points is true because it is the whole space and is therefore closed. We need only verify 
that each point in E is the limit of a sequence {y n } which does not include it. This is done as follows: Let x = x x x 2 . . . e E 
and suppose we have N symbols. For each n let y n be the point which agrees with x on all symbols except the n th , and is 
equal to the next greater symbol at the n th place unless that symbol in x is N - 1 in which case use N -2. Then for each n 
we have 

!■*»■ ~ y»j\ \x„ - y„J 1 

{x,Xn) ~ (N + \y (N + \r (N + \y 

because there is only one non-zero term in the sum. This sequence is then convergent-given e > 0 there is an M such that 
for n > M d(x, y n ) = 1 /(N + 1)" < 6. No term in the sequence is equal to x because it disagrees at the n lh term. Since x is 
arbitrary, every point in E is a limit point and E is a perfect set. 

11.3.21 If (5, d) is complete, then given a Cauchy sequence in 5, it converges to a point in 5. Since every limit point is the 
limit of a convergent sequence in 5, and hence a Cauchy sequence in 5, 5 contains its limit points and is therefore closed. 
Suppose 5 is closed, and let {x n } be a Cauchy sequence in 5. Then {x„} is a Cauchy sequence in X as well and there is a 
point x 6 X such that {x n } ->• x, because X is complete. Hence x is a limit point for 5. Since 5 is closed, x e 5. 



Answers to Chapter II, section 4 

11.4.1 In order for X to be closed, it must contain all its limit points. Thus let {x n } be a convergent sequence in X. By 
definition, there is an x e X such that {x n } x. Hence X is closed. A Cauchy sequence need not converge however. Let 
X C Y, such that Y is complete. Then if {x n } y & X, then this sequence does not converge in X, and y is not a limit point 
in X. The sequence is nevertheless Cauchy, and since it does not converge, X is not a complete metric space. An example is 
X = (0,1). 

11.4.2 See the answer to exercise 3.21 (Chapter II). 

11.4.3 By exercise II.3.15, if x is a limit point of the set 5 in Xj then 9(x) is a limit point of the set 0(5) C X 2 . Suppose 
there is an element in 0(5) which is a limit point, namely that there exists a sequence {(0(s))„} -> s e 0(5). Then given 
6 > 0 there is an N such that for n > N, 

d l (9- l (9(s) n ), 0- l (0(s))) < (\/ci)d 2 (e(s) n , 9(s)) < 6 

Letting 0 _1 (0(s) M ) = s n we see that 9(s) is the image of a limit point in 5. Since 5 is closed, the image of s is in 9(S), thus 
0(5) contains its limit points, and is therefore closed. 

11.4.4 Given x e X, by definition B(x, e) = {yinX : d(x, y) < e] and hence is a subset of X for any 6. Hence X is open. 
II.4.6 Let 5 be a bounded subset of (fR N , Euclidean). By definition, there is a point a e S and a number R, such that 
d(s, a) < RVs 6 5. Explicitly, 



N 

^T(s n - a n ) 2 < R 

M»=i 

The terms in the sum are all non-negative, it therefore follows that (s n - a n ) 2 < R 2 or that \s n - a„ \ < R, for each entry. 
This means that 5 is contained in the box [a x - R,a x + R] x [a 2 - R,a 2 + R] x ■ • ■ x [a N - R,a N + R]. To provide a 
finite e-net for 5, we subdivide each interval into M intervals such that N/M < 6. Then the product of the endpoints of 
each interval is a point in a finite 6-net of this Af-cube, its intersection with 5 is also finite and each point of 5 is less than 
6 away from one of the above corners. Hence 5 is totally bounded. To show that R N is complete, it suffices to note that R 
is complete (exercise II.3.2*) and that a sequence is Cauchy or convergent if and only if it is Cauchy or convergent on each 
of its coordinates. Finally, since (R N is complete, and 5 is totally bounded if it is bounded, it follows that if 5 is closed and 
bounded it is compact. 
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11.4.7 Let (X, d) be a metric space, / : X — >■ X be continuous, and A be a compact non-empty subset of X. It follows 
immediately that if a 6 A, then f(a) 6 /(A) by definition, so /(A) is non-empty. Let {b n } be a sequence of points in 
/(A). Then each b n is the image of at least one point in A. Let {a n } be any sequence in A such that /(«„) = b n for each 
n. Because A is compact, there is a subsequence {a ni } C {«„} which converges to some point a 6 A. The sequence {/(««,)} 
is a subsequence of {£>„}, we must show that it converges to a point in /(A). In fact, because / is continuous, given € > 0, 
there is a <5 > 0 such that if d(a n ., a) < 8 then d (f(a n .), f(a) < €. Since this subsequence converges in A, given <5 > 0 there 
is an N such that i > N implies d(a„ r a) < 8 and therefore that d(f(a ni ), f(a)) < e. This subsequence therefore converges 
in /(A), and letting b nj = f(a n .) we have found a convergent subsequence for the sequence {b n } C /(A). Hence /(A) is a 
compact non-empty subset of X. 

11.4.8 Let 5 C (X l5 di) be open. Then it follows that by example 2.4.5, X] \ 5 is closed. By exercise II.4.3, h(Xi \ 5) 
is closed in X 2 . Let t e h(Xi \ S). Then it follows that since h is invertible, h~ x {t) g 5, hence t e X 2 \ h{S). Hence 
h(X x \ 5) = X 2 \ h(S). Hence X 2 \ h(S) is closed which means that h(S) is open in X 2 . Note that the proof depends on 
the invertibility of h. It does not hold even for arbitrary continuous functions. 

11.4.10 Given x e (0, 1) we have 0 < x < 1 so choose € — min(x, 1 — x). Then the set B(x, e) = {y e X : d(x, y) < e} is 
a subset of (0, 1). Hence (0, 1) is open in X. Let x e {2}. This is a singleton set, so x = 2. Choose e < 1. Then the set 
B(x, €) = {y 6 X : d(x, y) < 1} contains only the point 2 and d(x, 2) — 0 <€. Hence {2} is open in X. The two sets are 
both closed: (0, 1) = X \ {2} is the complement of an open set, by 2.4.5 it is closed. By the same token {2} = X \ (0, 1) so 
it is the complement of an open set and is closed. {2} is compact, since given any sequence in {2} all entries are 2, hence 
the sequence converges, and has a convergent subsequence (any subsequence). (0, 1) is not compact, choose the sequence 
{x n — \/n). This converges to 0 ^ (0, 1), so any subsequence must likewise converge to 0. Hence it has no convergent 
subsequence. 

11.4.11 Let 5 C (X, d). By definition, its boundary 35 is the set of points s such that any open ball B(s, e) contains points 
in 5 and in X \ 5. But 5 = X \ (X \ 5), since it is the set of points which are in X which are not in Xsp \ 5. Consequently 
if any open ball B(s, e) around a point s contains points in both sets it is in the boundary 3(X \ 5, and the two boundaries 
are the same. X is open (exercise II.4.4) hence 0 is closed, and contains it's boundary. Since this implies the boundary of X 
is contained in the empty set, 3X = 0. 

11.4.12 Let (X\,d\) and (X 2 , d 2 ) be equivalent metric spaces and let h : X] ->■ X 2 be the metric equivalence. Let 5 c Xi, 
and let s 6 35. If h(s) g dh(S) then there is an open ball B(h(s), e) which is either a subset of h(S) or of X 2 \ h(S). If 
it is a subset of h(S), then it is the image of an open set in 5 (see problem II.4.8), and by definition there is an open ball 
around s in 5, which contradicts the statement s 6 35. If it is a subset of X 2 \h(S), then there are no points in B(h(s), e) 
which are the image of points in 5. Hence B(h(s), e) is the image of an open set containing s which is contained in X] \ 5. 
Then there is an open ball within this open set, which is centered at s, and contains no points in 5. Again this contradicts 
5 6 35. Consequently, since this is true for all s in 35, h (35) C dh(S). If we do this again for the transformation h~ x with 
the boundary of h(S) we get that the inverse image of the boundary of h(S) is contained in 35. Then the two boundaries are 
images of each other, and h(dS) = dh(S). 

11.4.13 5 C (IR, Euclidean) is the set of rational points in the real line. We want to show that the boundary of 5 is R itself. Let 
x 6 [R, and let € > 0 be given. Suppose a, b are two integers such that b — a + 1 and a < x < b. Now let N be chosen such 
that 1 /2 N < €. Since the numbers a + k/2 N for k <2 N subdivide the interval [a, b] into 2 N equal parts of width 1 /2 N < € 
it follows that any interval which is greater than € in width contains at least one of them. But B(x, e) = [x — e, x + e] is of 
width 2e, hence there is a rational number in B(x, e). The number s/2 is not a rational number: Let p/q = s/2 and suppose 
the fraction p/q is in lowest terms. Squaring both sides yields p 2 = 2q 2 implying that p is even. Hence p 2 is divisible by 4. 
Hence q is even, a contradiction. If c is irrational, so is pi icq) for any rational p/q. If not, by simple algebra we may write 
c in fractional form. Now take all the numbers in [a, b] of the form a + k/(*/22 N . Once again these are evenly spaced and 
any interval of width less than e contains at least one of them. Hence there is a number in B(x, e) which is in R \ 5. Hence 
x 6 35. Since x was arbitrary, 35 = R. 

11.4.14 The only point in C for which every e-ball contains points in C and points in C \ C = {oo} is oo. Hence the boundary 

of C in € is {oo}. 
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11.4.15 Let 5 C X be closed. Suppose that there is a point in X such that for every e > 0 B(x, e) contains points in 5 and 
points in X \ S. Define a sequence {x n } by x n e B(x, l/n), x n ^ x. If this cannot be done, then 95 = 0 and the statement is 
true. Then the above sequence makes x a limit point of 5 and hence x e 5. Hence 35 C 5. 

11.4.16 Suppose 5 is open. Let s e 95. Then for every € > 0 there is a point x e B(s, e) such that x $ 5. Then there is no 
e > 0 such that B(s, e) C 5. Since 5 is open s g S. Hence 35 n 5 = 0. 

11.4.17 By definition 5° = 5 \ 95. Since 5 0 95 = 0, 5 \ 95 = 5. Hence 5° = 5. Conversely, suppose 5° = 5 Then 95 = 0. 
Consequently for any s e 5 there is an e > 0 for which B(s, e) contains no points in X \ 5. Then B(s, e) C 5, and 5 is open. 

11.4.18 Since 5 is closed, by exercise II.4.16, 95 C 5. By definition 5° = 5 \ 95. We have 5 \ 95 U 95 = 5. 

11.4.19 By exercise II.4.12, being the boundary of a set is invariant ujader metric equivalence. Thus if s e 5°, then s e S 
and s £95. Since h(s) e ft(5), and h(s) e dh(S) = h(dS) implies that s G 95 which contradicts hypothesis, h(s) e (ft(5) \ 
dh(S)) which means h(s) e (ft(5))°. 

11.4.20 X \ 95 is 5 \ 95 U (X \ 5) \ 95. Since 95 = d(Xsp \ 5) (by exercise II.4.11) X \ 95 = (X \ 5)° U 5°, which are 
disjoint sets, and by II.4.17, are open. Note that one or the other or both may be empty, however. For the examples, (a) the 
boundary of the set 5 = {(*, y) e R 2 : x 2 + y 2 < 1} is the set 95 = {(*, y) e R 2 : x 2 + y 2 = 1}, since every 6-ball around 
a point in this set contains (x, y) e 5 and as well (x, y) such that x 2 + y 2 > 1. This is a circle of unit radius and divides 
the plane into the open disk interior, and the plane with a circular hole of radius one in it. (b) S - R 2 the whole space, so 
35 = 0. The two sets R 2 and 0 are both open as shown in II.4. 1 and II.4.4. 

11.4.21 By problem II.4.20, X \ 95 is the union of two disjoint open sets. The union of open sets is open, hence 95 is closed. 

11.4.22 We start this problem with an elementary fact about compact sets. Let C be a compact set. Then if A C C is a closed 
subset, let C n be an open cover for A. We can extend this cover to an open cover of C, by adding e -balls in C \ A (which is 
open) we can form an open cover for C. There is therefore a finite subcover [C u ■ ■ ■ C N }. Since by our choice, all of the C„ 
which cover A are from the original open cover, and they are a subset of this finite subcover, there is a finite subcover for A. 
It follows that any closed subset of a compact set is compact. 

We now turn to the problem at hand. 5 is a subset of a compact metric space. By problem II.4.21, 95 is closed, and also 
a subset of the same compact metric space. Therefore 95 is compact. The assertion is best proven by proving the following 
statement which is true for arbitrary subsets 5 (closed, open, otherwise). Let 5 be a subset of a metric space (X, d). Then 
A C 5 is open in the space (5, d) if and only if we can write A = 5 n O for some open subset of X. If A is open in (5, d) 
then for each a e A and each € > 0 

B s (a, €) = {yeS: d(a, y) <€} = {y eX: d(a, y) < «?} n 5 = B x (a, «?) n 5 

so A = 5 n O where O is the union of such balls in X. Conversely, if A = 5 n O for some open set O e X then given a e A, 
there is an € > 0 such that B\{a,€) c O, and 

B s (a, €) = {yeS: d(x, y) < <?} = {y e X : d(x, y) < <?} n 5 = B x (a, <?) n 5 

The latter intersection is in A = S n O, so A is open. 

The assertion that the interior of the land is open in Z despite the fact that it includes points in the "border" of Z, is true 
by the above statement. The fact that it appears to include border points comes about because these points are points in 
9Z c X, whereas when Z is itself the metric space, the border of the whole space is the empty set. 

Answers to Chapter II, section 5 

II.5.1 The properties of connectedness preserved by metric equivalence: (i) Connectedness. Let (X,,^) and (X 2 ,<i2) be 
equivalent metric spaces and h : X, X 2 be the invertible function giving this equivalence. It was proven last section 
(problem II.4.8) that if 5 is an open set, then h(S) is open, and (problem II.4.3) if 5 is a closed set, then h(S) is closed. 
Because h~ ] also establishes an equivalence, the same is true about closed and open sets in X 2 under h~ x . Let X, be 
connected. Suppose 5 C X 2 is both closed and open. Then ft" 1 (5) is closed because 5 is closed, and open because 5 is 
open. Since X] is connected, ft" 1 (5) is either X] or 0. In the first case, because ft is onto, 5 = X 2 . In the second 5 = 0. 
Hence X 2 is connected. 
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(ii) By studying the definition of metric equivalence, we see that given e > 0 there is a 8 = (l/c 2 )e such that d 2 (h(x), 
h(y)) < e whenever d\(x, y) < 8. Consequently h is a continuous function. The composition of continuous functions is 
continuous. Suppose Xj is pathwise connected. Given two points x, y e X 2 because h is onto, there are a pair of points 
u,veX\ such that h(u) = x, h(v) = y. Because Xi is pathwise connected, there exists a continuous function / : [0, 1] — >■ 
Xi taking f(0) = u, f (\) = v. But then there is a continuous function h(f(y)) taking a point y e [0, 1] to X 2 such that 
h(f(0)) = h(u) = x and h(f(\)) = h(v) = y. Consequently, X 2 is pathwise connected. 

(iii) Suppose Xi is disconnected. Then there exist a set A ^ Xi and A ^ 0 which is both open and closed in X[. As in the 
proof of i), since this set is open, h(A) is open in X 2 , since the set is non-empty, h(A) is non-empty, and since A^X X and 
h is onto, h(A) ^ X 2 . Hence X 2 is also disconnected. 

(iv) Suppose Xi is pathwise disconnected. Then there exist points x, y eXi such that there is no continuous function 
mapping [0, 1] to Xi with f(0) — x, f(l) = y. Suppose there were a continuous function g : [0, 1] — ► X 2 such that g(0) = 
h{x) , g(\) = h{y) . Then, because h~ x is continuous, the function h~ l (g(w)) taking the interval to Xi would take /i _1 (g(0) = 
x, h' l (g( 1 )) = >', a contradiction. Hence X 2 is pathwise disconnected. 

(v) Suppose Xi is simply connected, and, because h is continuous and onto, let f\, f 2 : [0, 1] — ► X 2 be two paths 
connecting points h(x), h{y) in X 2 . Then there exist paths /?~'(/i([0, 1])), h~ l (f 2 ([0, 1]) connecting x, y e X { . There is 
then a continuous deformation g (s, t) transforming /i~'(/i([0, 1])) to h~\f 2 ([0, 1])). Then h(g([0, 1] x [0, 1])) X 2 is a 
continuous function, and 

h(g(s,0)) = h(h-\Ms))) = Ms), 
h(g(s, l)) = h(h- l (f 2 (s))) = f 2 (s), 

while 

h(g(0, t))=h(h-\h(x)) = h(x), and 
h(g(l,t)) = h(h-\h(y))) = h(y), 

hence X 2 is simply connected. 

(vi) Finally, assume the converse, that X { is multiply connected. Then there exist two points x, y e X { and two paths 
fi, h •' [0, 1] — »■ Xi with no continuous deformation between them. We again transfer everything to X 2 . The points 
h(x), h{y) in X 2 have the two paths /i(/i([0, 1])), h(f 2 ([0, 1])) connecting them. If there were a continuous deformation 
g : [0, 1] x [0, 1] X 2 deforming one path to the other, then h '(g([0, 1] x [0, 1])) — > Xi would provide a deformation in 
Xi. Hence there is no such deformation, and X 2 is multiply connected. 

11.5.2 The metric space (^Euclidean) is simply connected: Choose x, y and two functions from [0, 1] — >■ ■ such that 
/o(0) = /i(0) = x and / 0 (1) = = y. Let g in the definition be given by g(s, t) = t ■ f 0 (s) + (1 - t) ■ fi(s), where 
a(x, y) — {ax, ay). This satisfies the four demands of the definition, we must only check that for all values of (s, t) it 
remains within the square. It would be maximized in, say, the x coordinate by setting / 0 (s) = fi(s) = (l,y) in which case 
for any t the function would equal t + I — t = (l,y). Since this is clearly in the square, there is no problem there. The other 
cases are identical. Hence ■ is simply connected. 

11.5.3 To show that X = ((0, 1) U {2}, Euclidean) is disconnected, we examine the subset {2}. This set is closed since points 
are closed in metric spaces. It is also open: The ball B e (2) c {2} for e = 1/2 since by definition this ball is the set of all 
points in X which are closer than 1 /2 to the point 2 of which there is only the point itself. Consequently, for every point in 
the set {2} c X, there is an open ball around that point in the subset, hence the subset is open. Since this set is both open and 
closed, and not empty or equal to X, the set is disconnected. 

11.5.4 Given the codespace metric we have used, for the space S on the N symbols 0, . . . , (N — 1), there is a metric 
equivalence to the subset of [0, 1] written in base N + 1 of real numbers with no N + 1-ary digits equal to N. The 
equivalence is generated by the mapping x\x 2 . . . i-> .x\x 2 . . .. The map has the same distance formula in both cases, and is 
clearly 1-1 and onto. The set of such real numbers is a totally disconnected subset of ([0, 1], Euclidean): Given any closed 
subset of the real line, either that subset is a single point, or an interval (possibly [a, oo)). If not, we may write the subset as 
the union of two closed disjoint subsets, and there is a distance between them say c. Then for any point in each subinterval, 
an e -ball around that point of radius c/3 contains only other points from the same subinterval, hence each subinterval is 
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open as well as closed and the set is disconnected. Since the set given above contains no intervals of with numbers with the 
digit N it is clearly not an interval, and it clearly contains more than one point. Hence codespace is disconnected. To see that 
it is totally disconnected, suppose there were an interval enclosed in the equivalent subspace of [0, 1]. Then if a = .x\x 2 ■ ■ ■ 
and b = .yiy 2 ■ ■ . suppose they agree to x, < y,. Then the point .x y x 2 ... (z, + 1 = /V) ... is in the interval [a, b] and not in 
the set. Hence there are no intervals in the set, and it is totally disconnected, by our metric equivalence, codespace is totally 
disconnected. 

11.5.5 Put the annulus in E 2 centered at the origin, take x, y on either side of the origin, and take f Q to be a path which goes 
around the upper half of the annulus, /, a path which goes around the lower half. Suppose the function g(s, t) exists as in 
the definition. Then for each point f 0 (s), s #Jt, y we have fo(s)\ y > 0 and the corresponding value for is negative. 
Holding s fixed the intermediate value theorem says that there is a value of t such that g(s,t)\ y = 0. Since we may take the 
annulus as thin as we please, this says that every point must pass through either x or y as it is deformed from one path to the 
other. If some points pass through x and some through y there is a value of t such that g(s, t ) is not connected, hence is not 
a path. Hence the annulus is multiply connected. 

11.5.6 Choose the set of nested subsets of R 2 given by 5, = 5, where B t ■ = {(x, y) : -1 < x < 1, y < /}. These sets 
are nested, they are all connected, and the intersection is the plane minus a strip around the y-axis 2 units wide, which 
is disconnected. 

II.5.8 The body space is multiply connected, it has a hole corresponding to the entire gastro-intestinal tract, if not others 
(e.g. pierced ears). 

Answers to Chapter II, section 6 

11.6.1 If x and y are elements of H(X) then by definition they are both non-empty and both compact. Their union is 
therefore non-empty. To see that it is also compact, let C be a covering of x U y by open sets. Then there is a subset of 
C which covers x, say C x and another C y which covers y. Because x is compact, there is a finite subset of C x which covers 
x, likewise for y. The collection of sets formed by taking all the sets in the finite subcover of x and all of those in the finite 
subcover y forms a subcover of x U y, and since its cardinality is the sum of two finite integers, is also finite. Hence C admits 
a finite subcover of x U y which is therefore compact. Since x U y is compact and non-empty, it is in H(X). To see that the 
intersection of two elements in H(X) need not be in H(X), take x and y disjoint. Then x D y is empty and therefore not in 
H(X). 

11.6.2 The elements of H(X) are compact non-empty subsets of X, while a compact non-empty subset of X is only a single 
point in H(X). An infinite collection of compact non-empty subsets of X is a valid subset of H(X), while it's union may or 
may not be a compact non-empty subset of X. 

11.6.3 Let B, C e H(X) and B C C. Let x e X. Then 

d(x, C) = mm(d(x, y)) 

yeC 

= min(min(d(x, y)), min (d(x,y))) 

yeB ye(C\B) 

< min(d(x, y)) = d(x, B) 

yeB 

11.6.4 The distance from (1, 1) to the closed disk of radius \ centered at (i,0) is given by the distance to the point on 
the boundary of the disk closest to (1,1), since this point is not in the disk. The closest point on the boundary is on 
a radius of the disk, by the triangle inequality. This means the distance is the distance from (1, 1) to (|,0) minus the 
distance along this chord which is inside the disk, namely \ . The whole distance is y/\ + (1 - 1/2) 2 = V5/4 so the distance 
d(x, B) = (V5- l)/2. 

11.6.5 In this case, the closed disk of radius \ looks like a square with corners at (0, 0), (0, 1), (1/2, 1/2), (1/2, -1, 2) 
since this is the set of points equidistant from (1/2, 0) at a distance 1/2. In the Manhattan metric, the distance to any point 
on the right upper face of this square is 1/2 + 1/2 = 1. Since the distance to any other face is larger, d(x, B) = 1. 

11.6.6 The elements of the set B are less than 3 only if n is odd. The sequence {^} increases monotonically towards zero, 
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consequently it has its least value at 1/2. The least value in B then is the value x x = 3 - 1/2 or 5/2. The distance to the 
point x = 1/2 is then d(x, B) = 2. 

II.6.7 It is easiest to provide a counterexample. Let B c A be a proper subset, such that there is an x e A with d(x, B) = 1. 
(For instance two disks centered at the origin one with radius 1 one with radius 2). Then d(A, B) > 1. Since B c A the 
minimum distance from any point in B to A is to the point itself, namely 0. Hence d(B, A) = 0. The two are not equal, so 
this is not a metric on Ti(X). 

11.6.11 Let A, B e H(X), A^B. Suppose d(A, B) = 0. Then for every x e A,d(x, B) = 0, since we take the maximum 
over these to form d(A, B). This can only occur if min yeB (d(x, y)) = 0, for every point xeA, which means that for each 
one, there is a y e B with y = x. In other words A C B. By the same argument, if d(B, A) is also 0, then B c A. If A c B 
and B C A then A = B. This is not the case so one or the other distance cannot be 0. The converse of the above is that if 
A C B, then for each point in A, 

d(x,B) = mm(d(x,y))=d(x,x)=0 

yeB 

so that the maximum over these is 0. Hence if A C B then d(A, B) — 0. 

11.6.12 Using the solution to problem II.6.3, for each x e A we have d(x, C) < d(x, B). Suppose d(A, B) < d(A,C). 
Then there is a point xeA such that d(x, C) > d(y, B) for all y e A. This is a contradiction, choosing y = x. Hence 
d(A,C)<d(A,B). 

II.6.14 We write out the expressions on either side of the inequality. 

d(A, B) — max(mind(.x, y)) 

xeA yeB 

On the other side, we have 

d(A, C) + d(C, B) - max(min(d(;c, z))) + max(mm(d(w, y))) 

xeA zeC weC yeB 

Since the maximum over w e C is greater than or equal to the value if w is replaced by z, we can write 

d(A, C) + d(C, B) > max(min(d(;t, z)) + min(d(z, y))) 

xeA zeC yeB 

For any choice of x e A and yeB the sum is greater than the distance d(x, y) by the triangle inequality, including the case 
where z has been chosen to minimize d(x, z) and where y has been chosen to minimize d(z, y). Hence for the choice of y 
the sum of the minima above is greater than d(x, y) for that choice of y and hence greater than the distance d(x, y) which 
is the minimum chosen to minimize d(x, y) directly. We may then write 

d(A, C) + d(C, B) > max(min(rf(jr, y)) = d(A, B) 

xeA yeB 

For the other case, whether or not d(A, B) < d(C, A) + d(C, B), we use the case C C B C A. Then the right hand side is 
identically zero, while if A ^ B, the left hand side is not. Hence the expression is not true in general. 
II.6.16 We make extensive use of II.6. 12. To this end, we have 

h(A U B, C U D) = d(A U B, C U D) V d(C U D, A U B). 

lih{A U B , C U D) — d(A D B , C D D) then either h(A U B, C U D) = d(A, C U D) or h(A U B, C U D) = d(B, C U D). 
Choose the former, then by II.6.12, 

h(A U B, C U D) < d(A, C) < h(A, C) < h(A, C) v h(B, D). 

If we choose the latter, 

h(A U B, C U D) < d(B, D) < h(B, D) < h(A, C) v h(B, D) 
If h(AU B,CU D) = d(CU D, A U B) then either h(A Ufi,CUD)= d(C, A U B) or h(A U B, C U D) = d(D, A U B). 
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Choose the former, then by II.6.12, 

h(A Ufi,CU/)) < d(C, A) < h(A, C) < h(A, C) v h(B, D). 

If we choose the latter, 

h(A Ufi,CUD)< d(D, B) < h(B, D) < h(A, C) v h(B, D). 
Since this is all the possibilities, the result follows. 

Answers to Chapter II, section 7 

11.7.3 The reasonable assumptions we could make are that each image is a set of points characterized by spatial coordinates 
and color coordinates. The spatial coordinates for a photograph are in two dimensions and form a filled square, they are thus 
compact. If we assume that the colors are a closed and totally bounded region, say grey scales from 1-100, then the product 
space of pictures is compact. We assume some continuity in time, this is backed up by physical assumptions, the tree will 
probably not do something totally disconnected from its motion in the interval before. Finally, we assume that the tree waves 
at finite speed. This implies that the change between photographs in picture space will decrease as the time interval shortens. 
This should yield a Cauchy sequence given that the distance between two photographs is the Hausdorff metric, applied to 
successive pictures as sets of points with (x,y,color) coordinates, and, say a Euclidean metric on this 3-space. A looks like a 
photograph of the tree at 1 second. 

11.7.4 In the space of non-empty compact subsets of the Sierpinski triangle, an example of an infinite set is (among others) 
the set of all singleton sets {x} with x e A. A Cauchy sequence in this set is given by successive righthand corners of 
triangles with lefthand (lower) corners at the lefthand lower comer of A. This sequence converges to the lefthand lower 
corner of A, the convergence implies that it is Cauchy. 

II. 7.6 If X is complete, then so is H(X). Since this is the case if X is compact, it remains to show that H(X) is totally 
bounded. Let e be given, and let {y } , . .., y k } be an e-net for X. The singleton sets {yi}, . . . , {y k } are elements of H(X) since 
such sets are compact in a metric space. Let A e H(X). The distance /i({y,}, A) is given by 

d( y! ,A)vd(A,y,) 

where by definition, the first distance is the distance from v, to the closest point in A, and the second is given by the 
maximum over points in A of the shortest distance from one of those points to v, that is the maximum distance of a point to 
y, so it is clear that this set of singleton sets does not in itself form an e-net of H(X). However, since the collection is finite, 
its power set is also finite (the collection of all possible sets made from the points >,. Take that union of singletons such that 
the distance from each of the singletons to A is less than € and call this set Y . This certainly exists, since each element of A 
is closer than e to some v, . Then the first element of the above formula is less than e. So is the second. For every element of 
A the distance to one of the elements in {v,} is less than e consequently d(a, Y) < eVa e A. Hence H{X) is totally bounded 
and hence compact. 

Answers to Chapter III, section 1 

III. l.l For all x e X we may write 

mcopies 

r o fix) = 7ifi~f(f n (x)) ■ ■ ■)) 

mcopies ncopies 

= 7aT~?(7(7(~7 u) •••))•■ -m 

m+ncopies 
= f m+n { X ). 

III.1.2 Let / : R -> R with fix) - 2x. Let f{x x ) = f(x 2 ). Then 2jc, = 2x 2 which implies that jc, = x 2 so / is one-to-one. 
Let jeK. Then for any y, 1 /2y e R, so that /(l/2y) = 2(l/2>') = y. Then / is onto. Since / is both one-to-one and onto, 
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it is invertible. Each iteration multiplies the result of the previous iteration by 2. Hence we have 

/"(*) = 2(2(2- ..(2*) •••)) = 2 n x. 



Similarly, 



111 1 
f-M = - 2 (- 2 (- 2 -( r ) -» = 2" 



Thus for any integer, we have f n (x) = 2 n x. 

III.1.3 Let / : [0, 1] — >• [0, 1] with /(*) = 1/2*. / is one-to-one since 1/2* i = 1/2*2 -*i =*2- But / is not onto, since 
f(x) — 1 * = 2 £ [0, 1]. Hence / is not invertible. 

IH.1.4 Let / : [0, 1] -» [0, 1] with /(*) == 4*(1 - *). Taking the first derivative, we have /'(*) =4-8*, which has a 
zero at * = 1/2 which is a maximum. The value of / there is 1. The minimum value must be at one or the other or both 
endpoints. The function takes the value 0 at both endpoints. Hence the function is onto. It is not one-to-one however, for 
example /(1/4) = /(3/4) = 3/4. 

III. 1.5 The first thing to show is that f(x)eCifxeC. We demonstrate this by noting that if * e C then * is the endpoint 
of some closed interval of the form either [(2"3 m )/(3*), (2"3 m + l)/(3*)] or [(3" - l)/(3*), (3")/(3*)]. Hence / raises the 
power of the denominator by 1 and is still of this form. So let /(*) e C. f is one-to-one since *i/3 = * 2 /3 x\ = * 2 . It 
is not onto, since 1 e C and */3 = 1 =»■ * = 3 g C. To find another transformation, note that if * e C then 1 — * e C. So 
/(*) = (1 — *)/3 is another such transformation. 

III. 1.6 Let / : IR 2 — >• R 2 with /(*i, x 2 ) — (2x x , x\ + *i). / is not invertible because it is not one-to-one. Let y = /(*) = 
/(*i, * 2 ). Then y = /(*i, — x 2 ) as well, since * 2 appears in the expression for / only in squared form. 
III. 1.8 An affine transformation applied to an interval takes it to another interval, possibly reflecting it (transposing the 
endpoints in interval notation), hence 

[a + b, 2a + b] if a > 0, and 



f([l,2])=a[\,2] + b= . r „ , L1 
J \[2a + b, a + b] ifa<0. 

To map into the interval we must have 

l<tf + 6<2, \ <2a + b <2 

The new length of the interval will be a, so that 0 < a < 1, with a = 0 mapping the interval to a single point. 
If f,g: [1,2]^ [1, 2] then /([l, 2]) C [1,2], hence 

go/([l,2])Cg([l,2])C[l,2] 

/og([l,2])c/([l,2])C[l,2] 

so that the compositions also map [1,2] into itself. 
We now want the conditions such that 

fog(X)Ugof(X) = X 

Let /(*) —ax + b, and g(x) = cx + d. We make the simplifying assumption that / o g(X) maps the interval to the lower 
end of itself, and g o /(X) to the upper end, and the two overlap across the middle at at least one point. We write out the 
compositions explicitly: 

/ o g(x) = a(cx + d) + b = acx + ad + b; 

g o /(*) = c(ax + b) + d — acx + be + d. 

The contraction of the interval is the same in each case, and is equal to ac. In order for the two maps to overlap, then, the 
length of each interval must be at least 1/2. We therefore have 1/2 < \ac\ < 1. There are two cases: If ac > 0 then fog 
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must map the interval to [1, 1 + ac) and g o f must map it to [2 - ac, 2]. We have 

ac + ad + b = 1 
2ac + be + d = 2 

ad + b—\ - ac 
be + d = 2 - lac 

If ac < 0 then / o g must take 1 h> 1 - ac and 2 h> 1. Similarly « o / takes 2 h> 2 + ac and 1 h> 2. This gives 

2ac + ad + b = 1 
ac + be + d = 1 

ad + b-\ - lac 
bc + d = l-ac 

Given two values for a, c satisfying 1/2 < \ac\ < 1, these equations allow calculation of b, d such that the above condition 
is satisfied 

III.1.10 I n = rdo). Consequently, we have /(/„) = /(/"(/„)) = /" +1 (/o) = W We are given that / = U~ 0 / n is a finite 
connected interval, hence 

/(u- 0 / n ) = ur =0 / n+ i = u~ = / \ /o 

Then the length I = lcngth(/ 0 ) + length(/(/)). The length of /(/) is the amount that it was contracted, hence is equal to 
at This is equal to the length of / \ 7 0 = I - b. This gives al = I - b or I = b/{\ - a). This yields, 

£*•«" 



n=l 



(1 - a) 



III.l.ll Let /, g : IR -+ IR be polynomial transformations. Then if 

/(*) = a 0 + a x x + ■ ■ ■ + a N x N ; gU) = b 0 + b x + ■ ■ ■ + b M x M 

we have 

/ o g(x) = a 0 + a x g{x) + a 2 (g(x)) 2 + ■■■+ a N (g(x)) N 

If we expand g(x) in the last term in the expression, the term with the higest power of x is a N (b M x") N . Smco by definition, 
a N b M ? 0, a N b» * 0 so this is a polynomial transformation of degree M ■ N. Using this, the degree of / (x) is N . 
Ill 1 12 For n > 1 a polynomial function / : R - R of degree n is generally not invertible. A quick example is^any 
polynomial having a, = 0 for all odd f. More precisely, suppose that fix) is invertible. Then there is exactly one number b 
such that 

a 0 + a\b + • • • a N b N = 0 
because the function must be one-to-one. This polynomial must then factor into the two terms 

(x - b)(c 0 + c\x + ■ ■ ■ + c n ^\x n " x ) = oo + • ■ • + a N x N 
The polynomial c 0 + c x x + ■ ■ ■ + either has no zeroes or it has only one *x = b. Assume not. Then by adding or 

subtracting from c 0 , we can force it to have a zero, thus creating a new function with two roots. Such a function is then not 
one-to-one and therefore not invertible. 

III.1.13 For large enough |,"| » \x^\ » • ■ - So for large \x\, fix) looks like l^.^^^mT^^ 
since \x1 - x" 2 \ > \ Xl -# 2 |. If / is an interval {x : \x - a\ < b} for fixed a,b e R, then if M > 0, tifiD)/fH) > M for 
b > p. Let / = [a - b, a + b]. Construct the polynomials and subtract. 

III.1.15 Parabolae opening up or down will work. For example fix) = a(x - l) 2 where a < 2. 
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IH.1.16 The function is continuous. fx(0) = 0 for all k. It is maximized by setting the derivative equal to zero, as follows: 

f x (x) = 2kx — kx 2 = k(2x — x 2 ) 
f[(x)=k(2-2x) 

so that fl(x) — 0 when x — 1. This is a maximum, since the second derivative is negative, and has the value / A (l) = k. 
Hence if k e [0, 2] then f x (x) : [0, 2] [0, 2]. The fold occurs at (1, k). 

HI. 1.17 To be a fold point of f(x), x must be a local maximum or minimum. If f(x) is differentiable, the condition for 
this is f'(x) = 0 and f"(x) ^ 0. Polynomial transformations are differentiable. If f"(x) — 0 we have an inflection for some 
functions, (for example f{x) = jc 3 ) so no fold exists at x. 

IH.1.21 To show that the Mobius transformation is invertible, we show that it is one-to-one and onto, as follows: 

one-to-one: 
. Let f{x\) = f(x 2 ). Then we have 

ax\ + b ax2 + b 
cx\ + d CX2 + d 

which yields 

acX]X 2 + bcx2 + adx\ + bd = acx\X2 + bcx\ + adx2 + bd 
or, bcx 2 + adx\ = bcx\ + adx 2 . That is 

(ad — bc)x\ = (ad — bc)x2 

Since the 'determinant' ad — be is assumed to be non-zero, X\ — x 2 and the function is one-to-one. 
onto: 

Let v e we need to find x such that 

ax + b 
cx + d 

We have 

cxy + dy = ax + b 

cyx — ax — b — dy 

(cy — a)x = b — dy 

-dy + b 

x — 

cy — a 

Since ad — be ^0 we have (— a)(— d) — be ^ 0 so this is another Mobius transformation, and / is onto. 
IH.1.22 We want to show that if f u f 2 are Mobius transformations then f x o f 2 is. Writing 

a x x+b x f _ a 2 x + b 2 
j x — — — ; j2 — — ~ 
C\X + d\ c 2 x + d 2 

we make the necessary substitutions: 

a ^72~xTr 2 + 1 _ a\(ci2X + b 2 ) + b\(c 2 x + d 2 ) 

1 C2X+d2 

_ (a x a 2 + b x c 2 )x + (a x b 2 + b\d 2 ) 
(c\a 2 + d\c 2 )x + (b 2 c x + d\d 2 ) 
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which has the form of a Mobius transformation. We need to check: 

(a x a 2 + b x c 2 )(b 2 c x + d x d 2 ) - (a x b 2 + b x d 2 )(c x a 2 + d x c 2 ) 
— (a x a 2 b 2 c x + a x a 2 d x d 2 + b x c 2 b 2 c x + b x c 2 d x d 2 ) 

- (a x b 2 c x a 2 + a x b 2 d x c 2 + b x d 2 c x a 2 + b x d 2 d x c 2 ) 
= (a x a 2 d x d 2 + b x c x b 2 c 2 ) - (a x d x b 2 c 2 + a 2 d 2 b x c x ) 
= b 2 c 2 (b x c x - a x d x ) + a 2 d 2 (a x d x - b x c x ) 
= (a 2 d 2 - b 2 c 2 )(a x d x -b x c x )^0 
because f x , f 2 are Mobius transformations by assumption. 

111.1.23 The action of the transformation f(z) = l/z on the Riemann sphere is to flip it over. It takes 1 1, -1 -1 so 
there is no rotation, and 0 i-> oo, oo h* 0 which inverts the two poles. 

111.1.24 When /(oo) = oo then a/c -*■ oo, since a < oo we have c = 0. Conversly, if c = 0 then /(oo) = oo by definition. 
Hence if oo i-> oo then the transformation looks like 

ax + b a b 

= — = S* + 5 

which is an affine transformation. 

111.1.25 / : R -► R takes 1 h> 2, 2 0 and 0 h* oo. So we have d = 0 and the function looks like 

ax + b 

fix) = 

cx 

1 h» 2 implies (a + b)/c = 2. 2 i-+ 0 implies 2a + b = 0 or b = -2a. This yields -a/c = 2 so that /(oo) = 2. We can 
only determine the transformation up to a constant (that is, multiplying top and bottom by the same constant yields another 
identical transformation) so set c = 1. This gives a = -2, b = 4 so the Mobius transformation 

-2x +4 

f(x) = 

x 

will do it. 

Answers to Chapter III, section 2 

III.2.1 Let the affine transformation be denoted, as usual, by 

<(;)•(- 

Since (0, 0) is mapped' to (4, 5), we have e = 4, and / = 5. From the relation (0, 1) ^ (-1, 2) we have 

(?)-(: »(?)♦(;) 

or, 

which gives = -5 and c? = -3. Finally, using (1, 0) h> (3, 0), we have 

0-C 3(J) + G) 

or, 4 

C)-0 + G) 
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which gives a = — 1 and c = —5. The function then is 



III. 2. 2 The affine transformation will be invertible if the matrix A is invertible, since if y — Ax + t, then A l (y — t) yields 
x again if A~ x exists. Let A and A~ x be denoted by 



Then we have, by definition, 



or, 



Solving the first two for e, f, 



Solving the second two for g, h, 



Putting these together, we get 



*-■-(; 0 

(e f\(a b\ (\ 0 
\g h)\c d) \0 1 

ea + fc = 1 
eb + fd = 0 
ga + he = 0 
gb + hd=\. 



-fd 

e = 

b 

— fad + fbc = b 



-b d 
f — 777 '■> e = 



det A det A 

—he 
^ a 
-hbc + had = a 



~ C ;h 



det A det A 



(' « ) 



det A 

Consequently, A~ x exists if and only if det A ^ 0. If it does, then if x 2 ) h> (ji, ^2) we have 



\x 2 ) det A \-c a )\y 2 -f) 
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III.2.3 Let/,,/2 



and 



Then we have 



with 



< O-t 5)(;W 



_faa' + bc' ab' + bd'\/x\ fae' 
~'\ca' + dc' cb' + dd')\y) \ce' 



+ bf' + e 
+ df' + f 



Letting 



= ad + be 
= ab' + bd' 
— cd + dc 
= cb' + dd' 
= ae' + bf + e 
= ce' + df' + f 



then, 



which is affine, as desired. 
III.2.4 Let 



+ 



-CO 



Then we have 



af + bh 
cf + dh 



The products subtracted to form the determinant det(Afl) are given by 

(ae + bg)(cf + dh) =aecf + aedh + bgcf + bgdh 
(af + bh)(ce + dg) = aecf + af dg + bhee + bgdh 



Subtracting these yields 



det(Afl) = aedh + bgcf - afdg - bhee 
= ad(eh - fg) - bc(eh - fg) 
= (ad - bc)(eh - fg) 
= det A • det B 



as desired. 
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III.2.5 The easiest way to visualize this is to break the desired transformation into two transformations, one which takes the 
triangle to the line between (0, 0) and (1, 1) and one which takes this line to the line between (2, 2) and (1, 1). The second 
transformation is an inversion and translation: 



which gives e — f = 2 and 



0-C SGM;) 



which gives a + b-2=landc + d-2=\, which by letting b = c = 0 gives a = - 1 and d = - 1 . The first transformation 
takes the triangle to the line between (0, 0) and (1, 1). (0, 0) <r+ (0, 0) so that e = f = 0. We have 



e have 

(o 1 m<D-(-\ :!)G)+G) 



so that b = d= \, and 



so that a = c = I. Multiplying the two together, we have 



To put this in scaling ratios and rotation angles, let 

r, + r 2 = V2 

Then the needed rotation angles are 



cos 9\ = sin 6 1 = cos O2 = sin 9 2 — 



-1 



which gives d\ = 6 2 = ^ . 
III.2.6 We start with the hint. Let 



Since is a translation term, and translation does not affect the area of any transformed polygon, without loss of 

generality e = f = 0. As well, it does not change anything to assume that one vertex of the triangle is at the origin. A 
triangle is then determined by specifying two points (*,, x 2 ) and (y u y 2 ) such that (y^yi) # a ■ (x u x 2 ) (which makes them 
non-collinear), and neither is the origin (which makes less than three distinct vertices, hence no area). First, show that the 
area of the parallelogram in figure 2.6 is given by 

area = \xiy 2 -x 2 y\\ 

(this is the length of the vector in a vector (cross) product). To see this, rotate the parallelogram until (y u y 2 ) becomes 
(y[, 0). Then if the new coordinates for (x u x 2 ) are given by (x[,x' 2 ) the area of the parallelogram is the base times the 
height, or y[x' 2 . The rotation is —9 with 



y/yf + yl ' yfyJ+yl 
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Figure 2.6. Getting 
the area of a triangle from 
a parallelogram 

y1,y2 



, x1 > x2 



°'° X 



We have: 



COS0 


sin 


— sin 6 


cos 


cos 6 


sin 


— sin 6 


cos 



x x cos# + x 2 sin6 
x 2 cos 6 — x\ sinf 



:) 

KyAJ y, cos 6> + y 2 sin 6>\ 
yi) \y 2 cos6 -y x sin6 ) 



so that 



y[x' 2 = (y x cosO + y 2 sin0)(x 2 cos6 - x x sinO) 

(y 2 x + yl)y\ x 2 = y^y 2 - ywxyi + - x x y 2 y] 
= y 2 x (x 2 y x - x x y 2 ) + ylixiyx - x x y 2 ) 
yielding \y\x\ | = |x,y 2 - y,x 2 \ as desired. The area of the desired triangle is then \ \x x y 2 - y 2 x x \. Apply the transformation 

to each of the two vectors. Then the new area is derived by 

x\ = ax x + bx 2 ; y\ = ay x + by 2 
x' 2 = cx x + dx 2 ; y 2 = cy x + dy 2 



yielding 



^\acx x y x + adx x y 2 + bcx 2 y x + bdx 2 y 2 - acx x y x - bcx x y 2 - adx 2 y x - bcx 2 y 2 \ 

= - Had - bc)x x y 2 - (ad - bc)x 2 y x \ 
2 

= ^\det A\\x x y 2 - x 2 y x \ 

A polygon (and many other surfaces) can be broken into triangles on each of which this formula applies, which finishes the 
proof. To see that det A ^ 0 corresponds to reflection, write 



-((-.* ?)C S))~*C » 
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Note: The vector/cross/wedge product used to do the proof this way can be generalized to be applicable on other types of 
surfaces and hypersurfaces. The generalized version preserves the form | det A | -("area") to such surfaces and volumes. 
III.2.7 Let w(x) — Ax + t be an R 2 similitude. Then w has one of the forms: 

(xi \ _(r cos 9 — r sin 9 \ / x { \ ( e \ 
x 2 ) \rsm9 rcosO ) \x 2 ) \f ) 

(x\\ (r cos 9 r sin 0 \ ( Xl \ + ( e \ 

©--ft; «/)te)-(;)-*G)-(;) 



The first can be rewritten as 



By making the substitution 0 = — 9 the transforaiation 

-06 

-( 

■( 

so with this substitution, the second form for a similitude becomes 



cos t> — sin 6 
sin 9 cos 9 



cos 9 


— sin 


— sin 9 


— cos 


COS 0 


— sin 0 


sin 0 


COS 0 



feH*fe)+(;) 



which has the proper form. 

HI.2.9 There are many possible ways to write a similitude w(x) — Ax + 1 which will take the Sierpinski triangle to a subset 
of itself, and we give the simplest one. To take the triangle to the subtriangle with corner (0, 0) we note that by taking 

(0, 0) h+ (0, 0) 

(l,0)h+(i,0) 
1 1 1 



we have completely determined w. The first transformation implies that 

'-(;)-(» 



For the other two, 



so that a = \ and c 



so that b = 0 and d=\. Then w looks like 



C 2) (;)-«-( 
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III.2.10 The simplest way to show this decomposition is to multiply it out. 



(J »( 
-( 
-( 



COS0 

sin0 



r\ cos 0 
r 2 sin 0 

r\rj, cos 0 
r 2 r^ sin 0 



- sin 0 \ ( 


r 3 


0> 


cos0 / \ 






—r\ sin 0 \ 


(5 


0 


r 2 cos 0 J 






—r\r 4 sinl 


") 




r 2 r 4 cos 0 







This is a general affine transformation because the four terms r,r 3 , r,r 4 , r 2 r 3 , and r 2 r 4 are //warty independent. Thus means 

that none of these can be written as a linear combination (sum with constant coefficients) of the other three hence any 

arbitrary value can be written this way. It is more important to understand what is going on geometrically which is done 

by looking at the action of this transformation on the coordinate axes. The transformation by r 3 , r 4 rescales the coordinates 

that is stretches or shrinks each coordinate and possibly reflects one or both. The rotation matrix rotates f 

by some angle 0, leaving the rescaled coordinates off axis. The action of the second rescahng has two functions. It further 

stretches or shrinks the rotated axes, but because they are no longer lined up with horizontal or vertical it also changes 

the an g le between them. This has the effect of skewing the new map of the plane. We are left with a 

translates, rotates, rescales, reflects and skews. This is the most general affine transformation, and requires all three parts of 

the decomposition to accomplish it. 

III.2.12 We multiply the two matrices to get the identity matrix: 



( a b \ { delA deM \ = ( 
\ C d /\d*A d^A / VlitX 

■( 

-( 



ad— be ba—ab 
detA 
—bc+ad 
det A 



det A 
det A 

0 

1 0 
0 1 



0 

det A 
detA 



which proves the statement. 
III.2.13 Let 



-C 



By exercise III.2. 12 we have 



= — ( 

det B \ 

( ^_ ■ 

__ ( detB detB J 
\ detB detB / 



d 

-c 

d -b 



Then we have 



440 Selected Answers 



BAB 



0 :)(; s) 

\ / \ « / \ det B det B ' 

\c d)\ sidi£ ">>-$i> ) 

aed-afc+bgd-bhc a 2 -aeb+bah- gb 2 

det B det B 

ced-fc 2 +f>d 2 -dhc caf-ceb+dah-dgb 

det B det B 



which gives 



tr {BAB ') = -l—(aed - afc + bgd - bhc + caf - ceb + dah - dgb) 



det B 
1 

det B 
det B 



(ade - bee + adh - bch) 
(e + h) =tr A 



det B 

The other equation is somewhat simpler: 

det(BAB- 1 ) = det 5 det Adetfl -1 = det B det B~ l det A = det(BB~ l ) det A 
Now 55"' = / the identity matrix, and det / = 1, so 

det(BAB~ l ) = det(BB~ l ) det A = det A. 
III.2.14 We simplify the problem by noting that if q is any scalar, 

\Hqx)\ _ / (aqxi + bqx 2 ) 2 + (cqx { + J^x 2 ) 2 \ Z 
I?* I V <7 2 (x 2 + x 2 2 ) J 



-( 



g 2 (axi + frx 2 ) 2 + q 2 (cxi + Jx 2 ) 2 \ 2 
<7 2 (x 2 + x 2 2 ) J 



consequently, it suffices to maximize over x such that |x| = 1, that is, over the unit circle. For the Euclidean metric then, we 
let x, = cos 0 and x 2 = sin 0. Then recognizing that the derivative will be zero only if the derivative of the quantity inside 
the square root sign is zero, we maximize 

n d 

0= — ((acos0 + bsin0) 2 + (ccosO + ds'inO) 2 ) 

= 2(a cos 0 + b sin 0)(-a sin 0 + b cos 0) 

+ 2(c cos 0 + d sin 0){-c sin 0 + d cos 0) 
= -a 2 sin 0 cos 0 + ab cos 2 0 - ab sin 2 0 + b 2 sin 0 cos 6> 

- c 2 sin 6> cos 0 + cd cos 2 6> - cd sin 2 0 + d 2 sin 6> cos 6> 

fe 2 - a 2 d 2 + c 2 

= — ~ — sin 20 + ab cos 20 + — - — sin 20 + cd cos 20 

^ z 

0 = 2(ab + cd) cos 20 + ((b 2 + d 2 ) - {a 2 + c 2 )) sin 20, or, 
((a 2 + c 2 ) - (6 2 + J 2 )) 
2(ab + cd) 



tan 26> = 
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TWslastequation has (in genera.) four solutions. Two are minima, two are maxima, and the norm exists. The unit circle has 

lr x r I — 1 and is a diamond centered at the origin, with vertices on the axes at (±1 , 0) and (0, ±1). By looting ai me 
^l-r™.h, the transformed ^ 

From exercise II.2.9, this transformation is a composition of a coordinate r esc ai ^ a ™ , inequality will 

images is the maximum. We have 

A(xi, x 2 ) = (axi + bx 2 , cjci + dx 2 ) 
at the vertices we have \A\ =max((a + c), (b + d)). 

Answers to Chapter III, section 3 

thus have _ 

az + b adz + bd 
f^ = ~ d —-—\ d J- 

/(;)-&' «/) (;)+(;) 

Le, t = x + ,>.» = e + if and a = „ + ia>. We have az = («,* - W > + J> + «*> «e can write using real 

coordinates and matrix notation as 

Letting det A = r> (det A is positive since we presumed no reflection) and e = cm-C/r) we have : our « 
S out det A the determinant is 1 and with a = and * = -c, the new determent must read cos 0 + s,n 0 - 1, as 
check that the new values match up. i„ fin itv to a 

^SSaegSSSaSSSSBL 



have 

1 _ j 

z ~ Iz 



/(Z) = - = TZT2 



u ^ • o^r „f Ap ia«t exnression is 1 and each point does indeed move to it's conjugate. We 
is in fact enough to prove the assertion, but we will perform the explicit map. 
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(iii) We need a coordinate system to represent points on the sphere. Since the map used to generate the Riemann sphere is 
to map each point to the point on the surface where a line connecting the north pole to the plane point intersects the sphere, 
we choose the angle 0 between this line and the diameter of the sphere connecting the poles (interior angle). This gives us 
a "latitude" coordinate, for a longitude, we take the polar coordinate angle of the plane point, with respect to the positive 
real line. Expressing the point z in the plane as re w , this amounts to choosing d. We need the coordinate transformation for 
0. The line from the north pole to the point z forms the hypotenuse of a right triangle, the side adjacent is the radius of the 
sphere to the north pole, and the side opposite is the plane vector to z. Consequently, we have 

r — \z\ = tan0 

We may thus express the coordinate transformation as 

z = Z\ + izi = tan 0 cos 0 + i tan 0 sin 6 

or more simply as 

z =tan0e ,e . 

How do these coordinates change when we rotate the sphere? As with the equator, the 0 coordinate becomes the negative 
angle with the positive real line, that is 6 i->- -6>. For 0 we need some plane geometry. First, we note that if we have a 
diameter of a circle in the plane, and a point not on that diameter, the triangle formed by the chords between the endpoints 
and the third point is a right triangle. To see this, add the radius to the third point to divide the triangle into two isoceles 
triangles, and sum up the angles at the center of the circle. Second, we take the same planar circle, and draw the lines 
connecting a point on the perpendicular bisector of the diameter to the endpoints. If we now draw a line from one of the 
ends of the diameter to the point at which the line from the other end to the bisector intersects the circle, the triangle so 
formed is similar to the two triangles formed by the lines previously drawn to the bisector. Consequently, given that the 
tangent of the angle from the north pole to a point on the sphere is tan 0, the tangent of the angle formed by the south pole 
is cot 0. 

When we rotate the Riemann sphere around the real line, we have exactly this situation, namely that the angle 0 of the 
original point on the sphere is now the angle measured from the south pole. Consequently, the tangent of the new angle to 
the north pole is cot 0. We now look at the action of f(z) on z = tan 4>e w . We have 

/(*) = " = r~TTe =cot0e i( - e) 
z tan0e ,y 

so the spherical coordinates have taken 6^-6 and tan 0 cot 0 as desired. 
III.3.3 If f(z) is not a similitude then, by problem 1, c # 0. We may thus let 

a 



f = - 
d 

8 = - 
c. 

This writes f(z) in the proper form as we now check: 

/ a 



c 

ad — be 



e + 



z + g c z + 



d 



az+ja (2^) 
cz + d cz + d 
az + b 
cz + d 
f(z) 



J 
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as desired. 

III.3.6 We wish to show that a Mobius transformation / : C -> C is always invertible. We divide the problem into it s two 
cases: Let 

az + b 
f(z) = ——7 
cz + a 

(i) Suppose c = 0. Then the Mobius transformation can be rewritten as f(z) = a'z + b'. Let w = f(z). Then the inverse is 
given by 

w — b 

z = 



a 



(ii) Suppose c # 0. Then the transformation can be rewritten in the form 

f(z) = e + 

z + g 

Let w = f(z). Then the inverse may be written 

, / 
z = g + 



III.3.7 See answer to exercise 1.22 in Chapter III. 

Ill 3 8 We wish to transform the real line to the unit circle in C. This means taking z = Zi+iz 2 such that z 2 = 0 to \z\ = 1 • 
Looking at the Riemann sphere, this can be accomplished by rotating the north and south poles to the equator. Specifically 
we take oo / and 0 -i. Since we need one more point to define the transformation, we may do this by insisting that 
we rotate about the axis between 1 and -1 by taking 1 \-> 1. We have 



From this we get 



a 

OO h->- I =>■ - = I 

c 
b 
a 

a + b 1 

1 h+ 1 = 1 

c + d 



a — ic 
b = -id 
a + b 



c + d 
a+b=c+d 
ic — id = c + d 



Choose a = 1. Then we have 



f(z) = 



n-2i + i 2 \ 

d = ic 

z + i iz - 1 



-iz + l z + i 
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III.3.9 We do out the first few terms of /" (z) to see what we are dealing with: 

f°(z) = z 
f\z) 



1 



fHz) 



f(z) 



\+z 

1 



1 + 



This object is known as a continued fraction and is usually written as 

1 



[a 0 ,ai, ...,a k ].= a 0 + 



a x + 



a2+- 



There is a whole body of mathematics to deal with these interesting creatures, they can be used to represent numbers, 
functions, traces of matrices, and other things. We need only the following relations: Let p 0 = a 0 , p x = a x ao + 1, p k = 
a-kPk-x + Pk-2 and q$ = 1, q x = a x , q k = a k q k _\ + q k -2- Then the following relation holds: 

Pk 

[a 0 , a u . . . , a k ] = — 
Qk 

For our function, we have 

ru) = [o, i,i,..., i,z] 

with n one's implied. If we look at the growth of the terms /?, they represent (for terms before the n th term, the terms in a 
Fibonacci series. That is, let F 0 = 1, Fj = 1, F n — F„_i + F„_ 2 . The first few terms look like 

1, 1,2,3,5,8, 13, ... 

Since po = 0, p x = 1, p 2 = 1, p^ = 2, and so forth, we may write p k — F k _ u for k < n. Similarly, we may write q k — F k for 
k <n. Then f(z) may be written, using the above relations, 

r(z)= zF„_ 2 +F„_ 3 



For the inverse transformations, we look at 



Writing out the first few iterations, we have 



zF„_! + F„_ 2 



1 1 

w = =»z = -1 + — 

1 + z w 



w 

r 2 (u>) = -\+ 1 



■1 + 1 

u) 

Again we have the form of a continued fraction, this time 

= [-1,-1,-1,..., -!,«;] 
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Using exactly the same kind of analysis as before, we have 

Po = -1 
Pi =2 

Pi = -3 



<7o = 1 

9i = -1 

q 2 = 2 



which gives 



Pk - TFk+i ; Qk - ±F k 
wF n + F„_i 



wF n -i + F„_ 2 



nij.10 We want the geometrical interpretation of the map 



f(z) = i + 



z - 1 



To get this we first note that by rewriting 



m = <" + 



1 



iz + 1 + 1 iz + 2 



z-i z-i z - 1 

we have /(oo) =a/c = i, /(/) = oo. This suggests we first translate the plane to bring the point j to the origin. For this we 
have 

0(z) = z-i 

e-\z) = z + i 



eofo e~\z) = (i + — -. — : ) - i = - 

V z + i - 1 ) z 



Consequently we can describe the mapping as translating the plane up by one unit, mapping the plane to the Riemann 
sphere, rotating the Riemann sphere about the real line, and mapping back. The net effect is a translation and an inversion. 



Answers to Chapter III, section 4 

ni.4.2 To be correct inverse functions for f(z) we must verify that f(f ~~ l (z)) = z. For points in the upper half plane, we 
have 

f(a(xi,X2) + ib(x u X2)) 

= (a 2 -b 2 -l) + 2iab 



7(1 +xi) 2 + x 2 + 1 + Xl _ 7(1 + X] ) 2 + x 2 - l- Xl _ 2 
2 2 2 



/7(1 +x i ) 2 +x 2 + 1 +x, / 7(1 +x l ) 2 + x 2 -l-x l 
+ 2i,l W - 



i 0 . (l+xtf+xj-il+x,) 2 
= xi +2i, 



= x\+ ix 2 = z 

so that w\ does indeed invert f(z) there. w 2 differs from in the above derivation only in the imaginary part which replaces 
liab with 2i(-a)(-b) and hence also inverts /(z). Finally, for the lower half of the plane, we notice that we are in need 
of a negative sign preceding the imaginary term, since the positive square root is indicated. This is provided by replacing a 
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by —a. This has no effect on the real part, because a appears there only as an a 2 term, completing the test for the inverse of 
f(z). 

IH.4.3 Let f(z) = z 2 + 1. The critical points are those values of z such that f'(z) = 0, that is, the first derivative vanishes. 
f'{z) = 2z = 0 when z = 0 so this is the critical point of f(z). The critical values of f(z) are by definition the range values 
at which f'(z) vanishes, that is f(z) such that z is a critical point. This occurs at f(0) = 1 so 1 is a critical point for / (z). 

IH.4.5 We determine the local approximate affine transformations for these functions by expanding about zo in a Taylor 
(Laurent) series for f(z), and restricting to the first two coefficients a 0 and a\. 
(a) f(z.) = z 2 and z 0 =l.a 0 = /(l) = l. For the linear coefficient, we have 

/'(I) . 

Letting w(z) be the linear approximation, we have 



(b) f{z) = 1 and zo = 1 + We have 



u>(z) = l+2(z-l) = 2z-l 



1 

ao = f(z 0 ) = 



1 + J 

*' = / ' (Z0) ^ = (IT^ 



Using the same notation as in (a), we have 



w(z) = rT7 + (TT^ (z - (1 + i)) 

1 -z 1 + i 

+ 



\+i (l+O 2 (l+O 2 

2 _ z_ 
1 + i 2i 
i 2 

rZ + 



2 1 + i 

(c) f( z ) = (z - l) 3 and z 0 = 1 - i. We have 

flo = f(zo) = (d - i) ~ l) 3 = (-if = i 
a x = /'(zo) = 3(z 0 - D 2 = 3(-/) 2 = -3 

Putting these together gives 

iw(z) = i+-3(z-(l -0 
= i + 3 - 3 1 - 3z 
= -3z + 3 - 2i 
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IH.5.3 Let Xf be the fixed point of f(x) = Ax + t. Let 9(x) = x — x f . Then d~ l (x') = x' + x f , and we have 

0 od~\x') = A(x' + x f ) + t - x f 
— Ax' + Ax/ + t — Xf 
= Ax' + x f -Xf 
= Ax' = f'{x') 
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as desired. 

m.5.4 Using the coordinate change notation 9 : X X we have 9{x) - 2x - 1, which yields 6>~'(x') = (x' + l)/2. We get 
the transformed /' in the new space by 

f'(x') =9ofo 0"V) : X' -> X' 

which we get from 



giving us, 

2 



o / o fT V) =2(( ) + 1 ) - 1 = l -(x' - l) 2 + 1 



III.5.5 We first find the fixed points of the Mobius transformation 



Multiplying through we get 



which factors into 



f , <, (z+2) 



z + 2 = 4z - V 



z 2 - 3z + 2 = 0 



(z- l)(z-2) = 0 



There are two fixed points, at z = 1 and z = 2. 
Make the coordinate transformation which takes 1 i-> 0 and 2 k>- oo. The transformation 

ew = ^ 

z - 2 

will work. We need the inverse of this: 

z- 1 

w = 

z — 2 

(z-2)w=z-\ 
zw — 2w = z — 1 
wz — z = 2w — 1 
2w - 1 



z = 0-'(u,) = 



to - 1 

We make the composition f'(w) = 9 o / o 9~ l (w) as follows: 

' + 2 2w-l+2w-2 Aw -3 



so that, 

4io-3-2io + 3 2 
J 4io-3 2 4u> - 3 - 4u> + 6 3 



J {W) ~ 4 - ~ Aw - 4 - 2w + 1 ~ 2w - 3 



2u;-3 
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This map shrinks the sphere by a factor of 2/3. The whole process can be written by expanding the real line from 1 to 2 to 
the the whole positive real line, then shrinking the resultant Riemann sphere, then reducing the positive real line to the line 
between 1 and 2 again. The net result is shrinking the sphere in a non-uniform manner. 

III.5.6 Let W(x) = Ax + t which we express as usual. The fixed point of this transformation is 



which we rewrite as 

x fl =ax fl + bx f2 + e ; x fl = cx /, + dx fl + / 
We solve this as a system of linear equations: 

0 = (a - \)x fl + bx f2 + e 
0 = cx fl +-(</- \)x f2 + f 



which gives 



yielding 



A nearly identical derivation gives 



0 = c(a — + bcx/ 2 + ce 
0 = c(a- \)x fl + (a-\)(d- l)x f2 + e 
0 = ((a - - 1) - bc)x f2 + (a - 1)/ 
0 = (det(A - I))x f2 + (a-l)f - ce 

^ ce-(a-\)f 
Xfl det(A - /) 

bf-(d-\)e 
det(A - I) 



Notice that these equations do indeed become invalid if det(A — /) = 0, when the "fixed point" tends to infinity. To 
transform coordinates so that Xf is the origin, choose the transformation 6(x) = x — Xf. The inverse of this, letting w = 0(jc) 
is just x — w + Xf. The whole coordinate transformation is then 

6 o f o d~\w) = (A(w + x f ) + t)-x f 
= Aw + Axf + t — Xf 
= Aw + (Axf + t) — Xf 
= Aw + Xf — Xf 
= Aw 

So that the effect, in the transformed system is just the action of the linear transformation Ax. 
IH.5.7 By exercise III.2.13 we know that 

det D = XiX 2 = detA 

and 

tr D = Xi + X 2 = tr A 
Then by multiplying (X — X\)(X — X 2 ) each A, is a solution to the equation 

X 2 -(X l +X 2 )X + (X l X 2 ) = 0 
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tr AX + det A = 0 



Now 

det 



= eh - eX - hX + X 2 - fg 
= X 2 - (e + h)X + eh- fg 
= X 2 - tr AX + det A 

as desired. The above equation is called the eigenvalue equation for A, the numbers X\, X 2 are called eigenvalues, and have 
the property that there exist lines in U 2 such that for any v on those lines 

Av = Xv. 

The lines are called the eigenvectors of the matrix A. 

m.5.8 To make the transformation h (z) = \ for analyzing the behavior at the point at infinity, we first note that h~ 1 (w) = ± 
has the same functional form. Composing, we have 

, 1 w 

6 o / o 0" 1 = — = = 

7( 1 ) + 1 1 + w 

v ID ' 

The point at 0 is of interest, and is now a fixed point. Locally we may expand 

w 



a 0 + a x w 



1 + w 

where, a 0 = 0 and a\ is given by the Taylor expansion 

d ( w \ ^ (1 + w) 



a\ — 



7 

49 



</u; VJ + W (1 + w) 2 

so that near zero, this looks like f'(w) = (\/l)w which is a contraction. 
III.5.9 We have 

f ll (x)=x 2 -n; g k (x) = Xx(\ - x) 

and we want to reexpress / as / is a parabola opening up, centered at the j-axis with vertex at -/x, g is a parabola 
centered at x = 1/2 opening down, with zeroes at 0, 1. Consequently, we need to reflect /, and stretch it so that it's zeroes 
are one unit apart, and shift it to the right by 1/2. First, the transformation 6(x) = ~x will reflect the parabola as follows: 

e o / o e~\x) = e o /(-*) = -/(-*) 

which for any value of /x is 

-f(-x) = -((-x) 2 - fi) = -x 2 + /x 

a parabola pointing down with vertex /x centered on the j-axis. The transformation which will move the vertex to x = 1/2 
is given by 6(x) = x — 1/2. Combining these yields 6(x) = —x - 1/2, or, 

6>o/o6r'(x) = -x 2 +x- 1/4 + /X- 1/2 = -x 2 + x + (/x - 3/4) 

We now need to adjust this so as to produce a multiplication by X and remove the constant term. We use the transformation 
0(x) —x/X. Once again qomposing, we now have 

-x 1 
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so that 

e~\x) - -xx + - 

2 



We have 



eof fl oe- i (x') = eof fl (-xx' + ^ 

= 0(k 2 (^ -x' + x' 2 )-^) 

,2 ^ 1 M 

= + ^- ( 4 + 2 + I ) 

This is identical to the expression for g^C*') except for the constant term, we now find /x(k) by setting this term to zero and 
solving for ix. 

„ * 1 M 
0= - + - + - 
4 2a 

-/x _ a + 2 

~r ~ 4 

-1 2 -1 



We have 



1 jc —1,-1 
6(x) = - ; fi(k) = — A 2 + — A 

2 A ^ 4 2 



III.5.10 We want the behavior of g(x) — x 2 — 1/2 near the fixed points. We have 

2 1 
Xf = x f - - 

2 1 

x f~ x f~2 = ° 



1 ± V3 
X f = —2- 

and we show the behavior around these points using two methods. 
Analysis by moving the fixed points to the origin: 
Let 

6(x)=x -x fx —x- ( 1 

1 + 

e~ 1 (x)=x + x fl =x + ( — - — ) 



Then we have 



go6> 1 ) = + (1 + v3)jc H - 

4 2 

= y 2 (l+V3)y+i±^ 
gV) = 6ogo e~\x') = x' 2 + (1 + V3)jc' 
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which has a fixed point at the origin. Very close to the origin, since x' 2 x', we have 

g'(x') « (1 + V3)x' 
which is an expansion. For the other fixed point, take 

1 - \/3 

0(x)-x -x f2 -x - ( — - — ) 
1 - 

0-\x) =x +x f2 =x + ( — ~- ) 

which gives us 



8 ° < 



-w '\ '2 , /i A\ i , 1 — 2\/3 + 3 1 
(x ) =x + (1 - V3)x H 



X' 2 (1-V3)X'+ 1 ~^ 



2 

g'(x')=eogo9- l (x')=x' 2 + (\-V3)x' 
Again looking at behavior near the origin, we have 

g'(x') « (1 - V3)x' 

Since the coefficient is less than zero, there is a reflection, and since |1 - V3| < 1, it is composed with a contraction. 
Analysis by expanding g(x) in a Taylor series: 
The derivatives needed to expand g(x) in Taylor series up to the linear term, at the fixed points are 

g (Q \x)=x 2 -\ 
g (l \x) = 2x 

At the first fixed point, x f we have 

(0), 1 + V3 

2 ^ 

fli = g (1) (*)|iW3 = 1 +V3 

2 

and the expansion to the linear term looks like 

+ (1 + ^ _ 1W3, = (1 + ^ + (i W5 _ 2 _ ^ 
which is an expansion. For the fixed point x fl we have 

«0 = g V \X)\ - 

2 ^ 

2 

and the expansion to the linear term looks like 

+ (1 _ V3)(x - i^) = (1 - V3)x + (i^ - 2 = V3) 
which is a reflection and contraction as before. In general, it depends on the nature of the function which method is easier. 



452 Selected Answers 



If it can be transformed by inspection, transforming the fixed points to the origin is preferable. In general, the Taylor series 
expansion method is more straight forward and simpler. 

III.5.11 In exercise III.5.7 we showed that the entries of the diagonal matrix D satisfy the eigenvalue equation 

X 2 - tr AX + det A = 0 

Using the quadratic equation, the solutions to this equation are 

_ trA±V(trA) 2 -4 det A 
2 

and the condition stated in the problem is that the discriminant of this equation be greater than zero, in order to have two 
real solutions to the equation. If the discriminant is less than zero, then the solutions are complex conjugates of the form 
X k = x±iy = re ±w . The matrix defines a transformation which contains a rotation by 6 and therefore cannot be put into 
diagonal form, although it can be written as a rescaling and a rotation. 

Turning to the problem of finding the matrix B, we separate out for a moment the normalizing factor of 1/ det B for the 
inverse of B, which will turn out not to matter, and write 



det B ■ BAB' 1 = det B ■ D = det B ■ 

Letting 



ft a 



, ,'aed-afc + bgd-bhc a 2 - aeb + bah - gb 2 
detBBAB' 1 1 



we have 

\ ced - fc 2 + gd 2 - dhc caf - ceb + dah 
which yields 

ade - afc + bdg - bch = det B ■ X { 
adh +afc - bdg - bee = det B ■ X 2 
a 2 f -b 2 g -abe +abh=0 
d 2 g - c 2 f + cde -cdh=0 

The first two equations when subtracted or added, merely return to us the result of exercise III.2.13, that the trace and 
determinant are invariant under this kind of coordinate change. Since only these are dependent on the det B term in front, 
we can safely ignore it in what follows, which is why it was carried separately. Rewriting the last two equations as 

a 2 f - ab(e - h) - b 2 g = 0 
c 2 f - cd(e — h) — d 2 g = 0 
we may solve either one, solving the last one for c, we have 



d(e -h)± Jd\e - h) 2 + 4fd 2 g 
c= 



d (e - h) ± y/(e — h) 2 + 4/g 

r 2 
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Examining the discriminant, we have 

(e - h) 2 + 4fg = e 2 + h 2 - 2eh + Afg = e 2 + h 2 + 2fg - 2det A 

= e 2 + h 2 + 2det A + 2fg — 4det A = e 2 + h 2 + 2eh - 4det A 
= (e + h) 2 - 4det A = (tr A) 2 - 4det A 
which is the discriminant of the eigenvalue equation. Putting this in, we have 

d trA± V(tr A) 2 -4det A d 
c= _ ( v±—L h) = {k ._ h) 

or 

c{-f—)=d ; a(—t—)=d. 
A, — h ki — h 

Since det Z? ^ 0 the columns are independent, so we must choose a different A, for each of the above equations. Only the 
ratios are important here, so we choose a = c = 1 , and have 



(A 1 -/i)(A 2 -/i) /l 



A.i-ft 



In closing, we note that the eigenvectors of A are the solutions to the pair of equations 

e*\ + f x 2 = 
gj:! + /ij: 2 = kx 2 

which appears to yield the two solutions 

(*-g) . g 

These are actually identical, since by the eigenvalue equation 

k — e g 
f ~ k-h 

To solve for the eigenvectors, simply let x\ = 1 and use one of the corresponding values for x 2 . Hence the rows of the matrix 
B found above are precisely the eigenvectors of A. This is true in general, (for N x N matrices the eigenvalue equation will 
have N real roots if it is diagonalizable), and depending on the problem presents another way to find a useful transformation. 
III.5.12 To solve this problem, we make use of the methods developed in exercise III.5.1 1. We have 

'«-G 3)(;;)+0) 

Solving for the fixed point, we may rewrite this as 

'«-(J DU'^H-U) 

so there is a necessary translation by (0, — 1/2) to bring the fixed point to the origin. To diagonalize the matrix we have 

-a 3) 

so that tr A = 4 and det A = — 1 . This gives us the eigenvalue equation 

A 2 - 4A - 1 = 0 
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yielding 

ki = 2 ± V5 

We may now write down immediately the final form of the transformation, we want B and the above translation to produce 

/2 + V5 0 \ 
/U) = (, 0 2-V5J 

which is indeed a coordinate rescaling. A matrix for B is then given in exercise III.5.1 1 by 

*-(! S) 
-G 2) 

or to write out the whole expression for the coordinate transformation 9{x), 

*»-G SKIM") 

111.5.13 Let / be a transformation /: X -> X. We want to show that {/"}~ 0 is a semigroup, which requires showing that 
/" o f m : X -> X is in this set of transformations. But 

f»of m (x) = f n (f m (x)) = f n+m (x) 

so the set of iterates of / is indeed a semigroup. 

111.5.14 Let T : E — >• E be a left shift transformation on code space, i.e. 

X — X\X2X^ . . . 

T(x) = X2X3X4 . . . 

Since each application of T shifts the sequence to the left and drops the initial term, we have 

T"(x) = X n + \X n+ 2X n +T, .... 

The fixed points of T 3 (x) = x 4 x 5 . . . must therefore obey 

x S\ x h x h • ■ ■ = x k x h x ft ■ ■ ■ 

Listing out the first few equalities, we have 

X\ — X4 
X2 — x 5 
X?, — X(, 
X4 = Xj 



which yields 

X2 = 

In other words, any fixed point of T 3 (x) must repeat the first three symbols indefinitely. Explicitly, the fixed points of T\x) 
are 

000, 00T, 010, on, Too, ToT, TTo, TIT. 
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IH.5.15 We need to show that the Mobius transformations on IR form a group under composition. To check closure, we let 

ax + b ex + f 

w ^ = — tz, ; "^to ~ — rr 

cx + a gx + h 

The composition is then 

W\ O Wjix) = -7 

aex + af + bgx + bh 

cex + cf + dgx + dh 
- ( ae + b 8) x + (af + bh) 
~ (ce + dg)x + (cf + dh) 

which has the proper form. We check 

(ae + bg)(cf + dh) - (ce + dg)(af + bh) 

= aecf + adeh + bcgf + bdgh - aecf - bceh — adgf - bdgh 
= (ad — bc)eh — (ad — bc)gf 
= (ad-bc)(eh-gf)^0 

so the Mobius transformations on IR form a semigroup. They are invertible as follows: Let 

ax + b 



We have 



cx + d 

wcx + wd — ax+b 
wcx — ax = b — wd 

—dw + b dw — b 



x = 



cw — a —civ + a 



which has the proper form, with 



da - (-b)(-c) =ad-bc^O 
so there is an inverse for each element, and they form a group. 

111.5.16 This problem is identical in algebraic manipulations to exercise 5.12, with the replacement of a, b, c, d, z,w eC 
instead of in IR. 

111.5.17 Let f(x) = Ax + t, and g(x) — Bx + u be invertible affine transformations on K 2 . Then if w = fix), we have 

w - t - Ax ; A~\w -t)=x = A~ ] w - A~ { t — f"\w). 

Similarly, g~\w) = B^w - B~ l u. 
We have 

/ o g(x) = A(Bx + u) + t = ABx + (Au + t) 

which is an affine transformation. We need to check whether it is invertible both because it needs to be in our set of 
transformations, and to prove group structure. Let 

w - ABx + (Au + t) 
^ w — (Au + t) = ABx 

A~\w - (Au + t)) = Bx 
B- l A~ x (w - (Au + t))=x 
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Rewriting the left hand side, we have 



f~\w) = B- x A' x w - B~ x A-\Au + t) = B~ X A- X w - (B~ x u + B~ x A~ x t), 



which is affine. The inverse matrices B~\ A~ x must exist because by assumption /, g are invertible. Hence the set of 
invertible affine transformations is a group under composition. 

111.5.18 Suppose /(A) C A. This implies /"(A) C A, as follows: It is certainly true for n = 1. Suppose for every 0 < 
i < k, the statement holds. Since by hypothesis /* _, (A) C A, and /(A) C A if A C A, we have /(/* _1 (A)) C A. But 
f(f k ~ x (A)) = /*(A) so the claim is true by induction. To show that this implies that {/"} is a semigroup, recall that 
fof m = f n+m which is in the collection. Hence {/"} is a semigroup of transformations. 

111.5.19 Let w(x) = Ax + t where 



and ac ^ 0. This transformation is invertible with x = A~ x {w - t), as long as A~ x exists. This is the case since det A = 
ac # 0. To show that the set of transformations of this form are a group under composition, we need to show that if B, A are 
of this form then BA is as well, and, if A is of this form then A~ x is as well. 




Let 




and 




Then 




which is also of the proper form. We have 



ae ■ eg = (ac)(eg) # 0 



so the transformation is in the set. Furthermore, 




which is of the correct form, ca = ac, so the inverse is in the set. Hence the set of all such transformations w{x) on IR 2 forms 
a group. 

III.5.20 Given f{z) = Az 2 + Bz + C, we want 9(z) = az + b such that 



9 o f o6- x U') = (z) 2 + C 



Since 



e- x u') = -(z -b) 



a 



we have 




Composing 9(z) with this yields 



9 o f o 9~ x (z) = -z' 2 + (B- 2—)z + ( 
a a 



Ab 2 



- Bb + Ca + b) 



a 
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We must have A /a = 1 and B — 2{bA/a) = 0 for the desired transformation, meaning a = A and b = 5/2. Using these to 
simplify the constant term we have 

B B B 2 

e(z) = Az + -; C = CA + -- — 



Answers to Chapter III, section 6 

II 1.6.1 

(a) sup(— oo, 3) = 3; inf (— oo, 3) = — oo. 

(b) sup(C) = l;inf(C) = 0. 

(c) sup{l, 2, 3, 4, . . .} = +oo; inf{l, 2, 3, 4, . . .} = 1. 

(d) sup(K+) = +oo; inf (R+) = 0. 

111.6.2 Let / : X -> X be a contraction mapping on (X, d), and let so — inf {s e R : s is a contractivity factor for /}. Then 
so is a contractivity factor for /. 

Proof: Clearly 0 < s < 1 since if s is any contractivity factor for /, then 0 < s < 1 so that 0 < s 0 < s < 1. As well, the 
infimum is taken over a non-empty set since there is at least some s which is a contractivity factor for / by definition. 
Consequently, we need only show that s 0 satisfies 

d(f(x), f(y)) < s 0 ■ d(x, y) Vx, y e X. 

Suppose not. Then there is a pair x,y eX such that 

d(f(x),f(y))>s 0 -d(x,y). 

But this says that 

d(f(x),f(y))-s 0 -d(x,y)>0 

and since 

d(f(x),f(y))-d(x,y)<0 

by the intermediate value theorem, there is an s' with 1 > s' > s 0 such that 

d(f(x),f(y))-s'-d(x,y) = 0 

Hence s' is less than all valid contractivity factors, and there is an interval (so, s') which contains no values which are valid 
contractivity factors for /. This is a contradiction of the assumption that so is the infimum of the contractivity factors for /. 

111.6.3 Let / : X -> X and g : X -> X be contractions with contractivity factors s, t respectively. Then the contractivity 
factor of / o g is st. 

Proof: For all x, y e X we have 

d(g(x), g(y)) < t ■ d(x, y)=>s- d(g(x), g(y)) < st ■ d(x, y) 

But we also have 

d(f(g(x)), f(g(y))) < s ■ d(g(x), g(y)) < st ■ d(x, y) 
which by definition makes st a contractivity factor for f(g(x)) = f o g(x) as desired. 

111.6.4 Let w(x) = Ax + t : K 2 -> K 2 with fixed point x f and det(A - /) # 0. Let h{x) = x - x f = x'. Then h~ l {x') = 
x' + Xf. Represent 

*)(;:)+(;) 
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and 



Then we have 



•>=(:;:) 
•-«-(: ;)(;;:;:;)*(;) 

(ax\ + axj x + bx' 2 + bxf 2 + e \ 
cx[ +cx fl + dx' 2 +dx f2 + f ) 



_(ax\+ bx' 2 + x fl \ 
\cx[ + dx' 2 + x/ 2 J 



h o w o h (x 



because x f is a fixed point. Then 

+ bx' 2 4- x /, 
+ dx' 2 + x f2 

+ bx 2 
+ dx 2 

= Ax' = w'(x') 

Inverting this we have 

/i"' O U) O /!(jc) = O U)'(X — Xf) = h~ X {A{x — Xf)) = A(X — Xf) + Xf 

Finally, we check that A(x — x f ) +x f = Ax + t. Since 



'-( 
-( 



we have 



( X h\ = ( aX h + tx f2 + e \ 

\x f2 ) \cx fl +dx f2 + f J 



ax\ + bx 2 — (ax fl + bx f2 ) + (ax fl + bx f2 + e) 
cx x +dx 2 - (cx fl +dx f2 ) + {cx fx +dx f2 + f) 

ax\ + bx 2 + e 
cx\ + dx 2 + f 



(ax\ + bx 2 + e \ 
cxi +dx 2 + f J 



To prove the next claim we have 



and 



Suppose that 



= Ax + 1 

w(x) = w l (x) = A l (x — Xf) + Xf 

w 2 (x) = A(w(x) — Xf) + Xf 

= A((A(x - x f ) + x f ) - x f ) + x f 
= A(A(x - x f )) + x f 
= A 2 (x — Xf) + Xf 

w n (x) = A n (x - x f ) +x f 
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Then we have 

w n+l (x) = A((A"(x -x f ) +x f ) -x f ) +x f 
= A(A"(x-x f ))+x f 
= A n+l (x — Xf) + Xf 

so the result follows by induction on n. 

We now describe the conditions on A under which the map is contractive for the Euclidean metric. We want d(w(x), 
w(y)) < s ■ d(x, y) for some 0 < s < 1. This means, 

d((A(x -Xf)+x f ), (A(y -Xf)+x f ))<s ■ d(x, y) 

or 

{ax x -ax fx +bx 2 -i>x f2 + x fl - ay x +ax fl -by 2 + bx f2 -x fx ) 2 

+ {cx x - cx /j + dx 2 -dx f2 + x f2 - cy x +cx fl - dy 2 + dx Sl - x Sl f 
= (axi - ay x + bx 2 - by 2 ) 2 + (cx x - cy x + dx 2 - dy 2 ) 2 
= (A(x-y) l ) 2 + (A(x-y) 2 f 

Using the notation of exercise III.2. 14 we need 

\A(x -y)\ <s ■ \x -y\ 



\A(x-y)\ 

< sVx, y e X 

\x - y\ 

This is certainly true if we maximize over \x - y\ so we have as a condition that \A\ < s < 1 for w(x) to be contractive. The 

proof is identical with the Manhattan metric substituted and the appropriate changes made. 

III.6.6 Let / : K R be given by f(x) = l/2x + 1/2. Then / is a contraction mapping as follows: 

d(f(x),f{y))=d{ l -x+ X -, X -y+ 1 -) 
= l-+2- ( 2^ + 2 )l 



*2* 2^' 



1 1 
2 
1 

= -U-y| 
< \d(x,y) 

so that s = \ is a contractivity factor. To find the fixed point, we write 

1 1 

Xf= 2 Xf+ 2 
1 1 

1 2 Xf ~2 
orxf-l. Rewriting in the form f(x) = A(x - x f ) + x f , we have 
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With x = 0, f"(x) = 1 - 1/(2"). 
Writing the original, 



If 



/(x) = I(x-l) + l 
f n (x) = ^(x-l) + l 

1 

lim f(x) = lim — (x - 1) + 1 

n->oc n->oc 2" 

= 1 = Xf 



, 11 11111 

^=2 ( 2 X+ 2 )+ 2 = 2 + 4 + 4* 



1 1 " 1 1 

f(x) = -(f-\ x )) + - = V - + — X 

J 2 2 A* 2« 2" 

i=i 



then 



-ui 1 1 1 1 

/" +, = 5(E27 + ^ )+ 2 

1 = 1 
1 = 1 

^ 1 1 1 

= > 1 x H — 

^ 2'' 2" +1 2 

(=2 

— Z— f 91 9"+l 



91 ' 90+1 
1 = 1 z z 



so the formula for f(x) follows by induction. In the limit, 

00 ^ 

X f = £ Tn = 1 



n=l 

independent of x. 

111.6.7 Let (X,d) be compact, and / : X -* X be a contraction with contractivity factor s and fixed point Xf. Then 
/(X)CX,/(X)#X. 

Proof: (X, d) is compact so it is bounded. In particular, since the function g(x) = d(x/, x) is a continuous function from 
(X, d) to R, it has a maximum M, and a point x M gX such that dQt/, x M ) — M. Then for all x e X, 

/(*/)) =</(/(*),*/) 
<^(/(x w ),x/) 
< 5 • Af < M 

hence there is no x such that /Qt) = xm, and /(X) ^ X. 

111.6.8 The set of contraction mappings on (X, d) are a semigroup under composition. To see this, we only need to know 
that if /, g are contractions then so is / o g. By problem 3, if the contractivity factors for /, g are s, t respectively, then so 
is / o g with contractivity factor st. 
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HI.6.9 Let w(x) = Ax + t where 

^_/|cosl20° -\ sin 120° \ 
~ V \ sin 120° \ cos 120° ) 



and t = ^ ^ J. The easiest way to see that w takes the equilateral Sierpinski triangle to a subset of itself, and is a 
contraction is to break the transformation apart. The equilateral Sierpinski triangle is expressed by the following IFS: 

-ft) -a Dft) 

-ft)-tt DftMl) 
-ft) -fi DftMO 

Each transformation shrinks the triangle towards one of its vertices by a factor of 2, the vertices are (0, 0), (1/2, V3/2), and 
(1, 0) respectively. Looking at w we may rewrite it as 



1 /cos 120° -sinl20°\ /xA , /|\ 
2Vsinl20° cosl20° ) \x 2 ) \0 ) 

(\ 0\ /cos 120° -sinl20°\ /*i\ , (\\ 
\0 I7\sinl20° cosl20° / \x 2 ) \0 ) 

(i D*ft> + (0 

where R e is a rotation by 0 = 120°. For the whole attractor A, the map therefore rotates A by 120° about its left lower vertex 
(O), by an angle equal to the exterior angle of an equilateral triangle, depositing it as an equilateral Sierpinski triangle on the 
left side of the y-axis with vertices (-1,0), (0, 0), and (—1/2, V5/2). The next action is a contraction, since it is identical 
to w\, and creates, setwise, a mirror image of W\(A) on the left side of the y-axis. Finally, there is a translation to the right 
by 1/2 bringing the image to a half-sized equilateral triangle with vertices (0, 0), (1/2, 0), and (1/4, V3/4). In other words, 
it lies exactly (setwise) on the image u>i(A). So the map does indeed contract the triangle, and take it to a subset of itself. 
The fixed point of w can be calculated by 




(xA = /icosl20° -isinl20°\ / x,\ /l/2\ 
\x 2 J \\ sin 120° | cos 120° J \x 2 J \0j 

■(? ?)(:)*(¥) 



which yields 
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5 V3 1 n 

-X\ H x 2 =0 

4 4 2 

V3 5 

— x,--x 2 = 0 

V3 

X\ — Xi 

5 



10 
31 

_ 2V3 

X2 ~ ~3T 

HI.6.10 Let E be a code space on the two symbols {0, 1} and f(x) = Ix = lxix 2 . . .. Then using the metric 

d(x,y) = 2^—^i — 

we have 

d(f(x), f(y)) = + —f 1 



= E 

n = l 



1 oo 



3«+i 
\xi - yi 



3 3 ' 
1 

= -d{x,y) 

so that / is a contraction mapping with contractivity factor 1/3 and fixed point I. 

III.6.11 Let (X,d) be a compact metric space, and / : X -> X be a contraction mapping. Then {/"(X} is a Cauchy 
sequence in (H(X), h) and lim^oo /"(X) = {x f } where x/ is the fixed point of f(x). 

Proof: We first verify that {/ (X)} is in fact a sequence in H(X). Since / is a contraction mapping, / is continuous, and 
the continuous image of a compact set is compact. Hence /(X) is compact. By induction, /"(X) is compact. Furthermore, 
since x f e X and f"(Xf) = x f e /"(X), /"(X) is non-empty. Hence, /"(X) e H(X) for all n, so this is indeed a sequence 
in 7^(X). We next verify that the sequence is Cauchy. If so, since X is compact, X is complete, and therefore so is 7i(X). 
Hence if the sequence is Cauchy, it has a limit in 7i(X). Let s be a contractivity factor for f(x), and let e > 0 be given. 
Since s < 1, we may choose N such that for any n > N, s" < 8 where 8 is a number of choice less than 1. Let n,m be 
chosen n < m, and n, m > N. Since m > n, we have f m {X) C /"(X), so that 

h(f m (X),f(X))= max d{x,f m {X)) 

*e/"(X) 

But this must be smaller than the diameter of /"(X) defined by 

diam(/"(X)) = max d(x,y) 

x,yef"(X) 

We have 

diam(/"(X)) <s"diam(X) 
since s is a contractivity factor for /. By choosing N such that s N diam(X) < e, we guarantee that 
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h(f m (X), f n (X)) = max d(x, f m (X)) < s N diam(X) < e 

xef"(X) 

because all the distances considered for the maximum are less than diam(/"(X)). Hence the sequence is Cauchy and has a 
limit in H(X). 

We now show that that limit is x f . We know that x f e lim,,^^ /"(X). We need only show that 

lim diam(/"(X)) = 0 

n—*oo 

But this limit is bounded by 

lim diam(/"(X)) < lim s"diam(X) = 0 

Hence, if y e lim,,^.*, /"(X) then d(y,x f ) = 0, but d is a metric, so this implies y=x f . Hence 

lim f"(X) = {x f } 

n-*oo 

as desired. 

III.6.13 We are asked to show that if A = ^ is strictly positive definite (that is a, b, c, d > 0) then the transformation 
f(9) defined on the first quadrant by 



/cos0\ /cos/(0)\ 
A \sm0j \smf(9)J 



is a contraction mapping, where k is some positive number. Since we may scale k to fit the matrix A, we first check the case 
where det A = 0. This happens if the rows or columns of A are multiples of each other, and we have f{9) = 9 f a constant, 
so in this case the function / converges on the first application. Otherwise, by rescaling X this will work for any strictly 
positive definite A so long as it works for a strictly positive definite A such that det A = 1 . We need to have real eigenvalues, 
so that tr A = a + d > 2. We have a d - be = 1 or be = ad - 1 and bc> 0 so that ad > 1. Suppose ad = 1. Then d = l/a 
and we have a discriminant 



The expression 



has derivative 



a 2 + lad + d 2 - 4 = a 2 + \ - 2 
a 1 



2 1 

a 2 +- 
a 1 



d , 1 2 
— (a 2 +-) = 2a-- 
da a 1 a J 

showing zeroes when a 4 — 1 which since a > 0 occurs when a = 1 . This is a minimum, and yields a discriminant of 0, but 
because the determinant is 1, this would yield be = 0, which it can not be, so that we do indeed have such a k, and one 
choice is greater than zero. 

To show that / is a contraction mapping, we must show that 

1 d(f(9 l ),f(e 2 ))<s-d(0 u 92). 



The sine of an angle is an increasing function in the first quadrant, so writing 
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so that 



sin(/(0,) - f(9 2 )) = sin /«?,) cos f(0 2 ) - cos /«?,) sin f(0 2 ) 

c cos Q\+d sin #i a cos 0 2 + b sin #2 a cos 9\ + b sin 9. c cos 0 2 + d sin #2 
_ ( _ )( - ) - ( - )( - ) 

= ^ (ac cos 0\. cos 0 2 + be cos 0\ sin 0 2 + ad sin #i cos 0 2 + fcd sin 0 2 sin #i ) 

— ^ ( ac cos #i cos #2 + cos sin 0 2 + be sin 0 X cos 0 2 + W sin 0 X sin 0 2 ) 

= ^(ad(sin#i cos 0 2 — cos#i sin# 2 ) — £c(sin#i cos 0 2 — cos#i sin# 2 ) 
det A 

= — jj- (sin cos 6 2 — cos #i sin 6 2 ) 

det A . /n 
= -jj- sin(0, - 0 2 ) 

:, to get a contraction mapping, we need 

sin(/(0,) - f{9 2 )) det A 

= — = 5 < 1 

sin(6>, - 0 2 ) k 1 
To test this, recall that assuming det A = 1 , we have tr A > 2 and 

trA y/(trA) 2 -4 
2 2 

indicating A. > 1. So / is a contraction, implying that {/"(#)} — ► 9/ as desired. The fixed angle is in fact the angle of the 
eigenvector, yielding 

A. — a 
Or = arctan( — - — ) 
b 

Answers to Chapter III, section 7 

III.7.1 We have w { (x) = 1 /3x and w 2 (x) = 1 /3x + 2/3. We compute the contractivity factor for the IFS. 

Ill 1 

d{w x {x), wi(y)) = \-x - -y\ = -\x - y\ = -d(x, y) 

1 

Si = - 
3 

d(w 2 (x), w 2 (y)) = \(^x + ?) - (ly + ?| = I|* - y| = !</(*, y) 

1 

52= 3 

1 1 1 

s =max(s 1 ,s 2 )=max(-, -) = - 

We prove that the attractor of this IFS is the Cantor set by proving by induction that the action of W(B) is to remove the 
middle third of each interval in B. To this end, our hypothesis is: Let [a, b] be a component of B k with fc < n. Then 

[a, a + X -{b - a)] U[b-^(b- a),b] C B k+i 



and 



(a + ^(b-a),b-^(b-a))DB k+1 => 
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We establish that this is true for the first case: Let B 0 = [0, 1]. Then 

1 2 

B\ = W(B 0 ) = wi(fio) U w 2 (B 0 ) = [0, -] U [-, 1] 

Now suppose that [a, b] C B n is a component of B n . Then there is a component [a', b'] C fl„-i whose image is [a, b] under 
some Wi (i = 1, 2). By hypothesis, the component [a', b'] has had its middle third removed in B n . Then [a, b] is replaced in 
fi„ + i by the set 

Wi([a' + \ib'- a')] U [b' -\0>'- a')}) = [a + X -{b - a)] U [b - X -{b - a)] 

since under w t the length b' - a' b - a and clearly the two points a',b'^a,b respectively. Hence this operation is a 
removal of middle thirds, and the induction goes through. Then the attractor of the IFS is the classical Cantor set. 

111.7.2 We have w x (x) = s x x and w 2 (x) = (1 - s{)x + s u Let B 0 = [0, 1]. Then 

fii = W(B 0 ) 

= wd[0., l])Uu> 2 ([0, 1]) 

= [0,J,]U[J,, 1 -Sx +5,] 

= [0,j,]U[j,, 1] = [0, l] = B 0 

Hence B 0 = [0, 1] is the attractor of the IFS. 

111.7.3 Let wi(x) = |x and w 2 {x) = \x + \. Let B 0 = [0, 1]. Then 

fii = W(B 0 ) 

= u>,([0, l])Uu; 2 ([0, 1]) 

= [0,i]U[i,l] 

We have, at least on the first iteration, an operation which removes an interval of length J from an interval, with this removed 
interval being just to the left of the center of the interval. We test this by applying the induction hypothesis as in problem 1. 
Assume that for each k < n, any component interval in B k has a J slice taken out to the left of the halfway mark, specifically, 
[a, b] goes to 

[a,a + ^(b-a)]U[b-^(b-a),b] 

and 

(a + ^(b-a),b-^{b-a))nB k+l =Q 

Then if [a, b] is a component of B n , then it is the image under one of the maps, say of some interval [a', b'] C #„-!• In 
B n this interval has been replaced by 

[a', a'+ l - {b' -a')]U W - \(b' - a'), b'] 

the image of which under Wj is 

[a,a + ^(b-a)]U[b-^(b-a),b] 

which says that that B n+i is derived from B n by taking out the \ slice. A is a Cantor set, with uncountably many points, and 
no intervals. To see this last, if [a, b) were a component interval of A, then since W(A) = A, this operation does not change 
[a, b] but the above proof says it must lose a slice, hence [a, b] C A a = b. 

III.7.5 Let wi(x) =ax + b and w 2 (x) = cx + d as in the statement of the problem. The attractor A of the IFS is compact, 
hence it is bounded, and acheives its bounds. Then there is a closed interval [a,i]cR such that a = min{x : x e A} and 
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b = max{* : * e A). Suppose 

w 1 ([a,b])nw 2 ([a,b])^0 

Then since [a, b] is connected, and the w, are continuous, w\([a, b]) and w 2 ([a, b]) are connected, their intersection is non- 
empty, hence wi([a, b]) U w 2 ([a, b]) is connected. Furthermore, there are points a\, b\ e A C [a, b] whose image in one or 
the other map is a, b respectively. If there were any points in one of the images which were outside the interval [a, b], then 
the image of either a or b under one of the maps would lie outside of [a, b] and therefore not be in A which it is. Hence this 
image, w\([a, b]) U w 2 ([a, b]) is mapped to itself exactly by W and is therefore the attractor A itself. Thus A is connected if 
w\ ([a, b]) n w 2 ([a, b]) # 0. Suppose w x ([a, b]) n w 2 ([a, b]) = 0. Then it follows since A C [a, b] that w x (A) n w 2 (A) = 0, 
as well. Let [a', b'] be the longest connected component of A. Since w x (A) n w 2 { A) — 0, there is an / e { 1 , 2} and an interval 
[a", b"] (connected component of A) such that 

Wi([a" , b"]) = [a' , V] 

Let Si < 1 be a contractivity factor for io,-. Then 

d{a, b') < Si d{a", b") 

which says that either [a", b"] is longer than [a', b'] or the length of each is zero. The first contradicts the statement that 
[a', b'] is the longest connected component of A, consequently a' = b', and the only connected components of A are points. 
Hence A is totally disconnected. 

111.7.6 No. The reason for this is as follows: Because each map must have a contractivity factor of less than one, each map 
must take the proposed attractor A, that is the union of two disjoint intervals, to a subset of either one interval or the other. 
The image of each map will be the union of two disjoint intervals itself. Consequently, let w x take A into the first interval, 
and w 2 take A into the second. This situation must arise (possibly by renumbering the maps. It is possible to generate a 
single connected interval of either of these two images with 103, constructing one of the intervals of A. It is not possible to 
cover the gap in the other, since the image of io 3 must lie entirely in one interval or the other. (Note that the attractor could 
be constructed from four maps). 

111.7.7 Let {K 2 : 10 1, w 2 } be an IFS with 10, given by 

«(;)-G ;)(;)*(!) 
-0-0 1)0 

Let the attractor of the IFS be A. Then (x,y) e A implies that x = y, and x, y e [0, 1]. 
Proof: Let B = {(*, y) € 0& 2 : x = y, x, y e [0, 1]} Then 

-(;)-(£:!) 
-(;)=(!;) 

Since by supposition, x, y < 1, 1/2*, l/2y < 1/2 so adding 1/2 to each remains in [0, 1]. Furthermore, if x = y then 
1/2* = l/2y and 1/2* + 1/2 = \/2y + 1/2 so the image of B is contained in B. The image of B under w\ is {(*, y) e 
B : *, y G [1/2, 1]} exactly. This is because 0 f— > 1 /2 in each coordinate, and 1 h> 1 in each coordinate. The image of B 
under w 2 is {(*, y) e B : x, y e [0, 1/2]} exactly as well. Hence the image W(B) = B, and since the attractor is unique, 
B = A. 

111.7.8 We show that the attractor is the point {1}, together with the images w"(0). That these points are in the attractor of 
the IFS follows from the fact that the two fixed points 0, 1 must be in the attractor, and therefore all forward images of them 
must be as well. Let 

A = {x : x = \ or x = w"(0); n = 0, ... 00} 
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Let* e A. Then ioi(jc) = 0 = u;°(0), if x = 1 then w 2 (x) = 1, and otherwise there is a* such that u;£(0) = iso that w 2 (x) = 
w k+ \0). Hence C A. Furthermore if x = 0 then there is an element of A (for instance 0) such that x = w x (0), if x = 1 

then x = w 2 (l), and otherwise if x = w k 2 (0) then x = w 2 (w k 2 ~\0). Consequently W takes A onto itself as well. Therefore 
W(A) = A. A is a countable sequence of increasing points by definition. We want the geometric series for x n . Writing 
w 2 (x) = 2/3(x - 1) + 1 we have w n 2 (x) = (2/3) n (x - 1) + 1. Substituting x = 0, we have *„ = 1 - (2/3)". To put this in 
the form of a geometric series, 



-(l-fxf)-' 
2 3 



We may thus write the terms x n as the partial sums of the geometric series 



1 00 ? 



Answers to Chapter III, section 8 

There are no answers to section 8, all the exercises are programming experiments. 
Answers to Chapter III, section 9 

III.9.2 We assume that w 0 (B) = C for any B e H(X). We have 

-(;)-C7 o° 75 )(;)+( 0 o 25 ) 

To show that {U 2 : w 0 , w x } we need only establish the contractivity factor for w x . We may rewrite w x using multiplication 
by a scalar as 

-(;)— C) + (°o 25 ) 

d(w { (x), wdy)) = >/(.75jc, + .25 - .75y, - .25) 2 + (J5x 2 - .15y 2 ) 2 
= y/{.15\{x x -y x ) 2 + {x 2 -y 2 ) 2 )) 
= .15y/(x x -y x ) 2 + (x 2 -y 2 ) 2 
= .15d(x,y) 

so the contractivity factor for w x is 0.75, the map is contractive and this is a hyperbolic IFS with condensation. Let A 0 = C. 
We calculate u^(A 0 ) by using the A(x -x f )+x f form of the transformation. First the fixed point. 

x h = .75*/, + .25 => .25*/, = .25 =>x fx = 1 
* /2 = .75* /2 x f2 = 0 

Rewriting u;" as 
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we see that w"(A 0 ) is C shrunk by (0.75)" and moved to 1 - (0.75)". In other words, as the figure shows, the trees get 
smaller, closer together, and closer to 1 . By example 1 , 

00 

A = \JW n (Ao) 

n=0 

with 

A x = C U uj,(C) 

A 2 = C U Wi(C U Wi(C)) = C U iwi(C) U w]{C) 

Suppose that for k <n we have 

k 

A k = \Jw\(C) 

i=0 

then 

A n+ i = C U iyi(A„) 

= CUu>, ^(Juji(C)^ 
«+i 

= Cu\Jw\(C) 
i=i 

«+i 

= |J U ;' 1 (C) 

i=0 

Counting the number of transformations of C for A n produces n + 1 trees as desired. Let C be the trunk of the tree. That is 
w 0 (B) = C for any B e H(X). We define w\ , w 2 so as to form the two branches off the trunk, as follows: 

/x\_/rcos0 -rsin0\ /0\ 
Wl \y)~ \rsinO rcosO ) + \\) 

(x\_(rcosO rsin0\ /0\ 
Wl \y) \— rsinO r cos 6 J \\ J 

In other words, w\ rotates the trunk by 6, shrinks it by r and moves its base to (0, 1). w 2 rotates the trunk by —0, shrinks it 
by r and moves its base to (0, 1). Successive applications of either of these transformations rotates the trunk by ±n0,shrinks 
it by r", and moves the base to the top of the last piece. The figure becomes disconnected if the translation moves the piece 
above the top of the last trunk, it becomes not simply connected if r is close to 1 and 6 is large. 

IH.9.8 Without the 'els' instruction we get effectively the union of W applied to the initial set. By example 1, this is the 
attractor of an IFS with condensation, the condensation set being the initial set chosen. 

Answers to Chapter III, section 10 

HI.10.10 We make reference to the general form of the affine transformation expressed in terms of the fixed point derived 
in exercise III. 6.4. That is, for a contractive affine map, we have 

f{x) = Ax + / = A(x — x f ) +Xf 
f(x) = A n (x-x f )+x f 
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which we regard as the canonical form of a contraction mapping. We have, in this case 

Wi ( z ) = Si + (1 - s,>, = s,(z - a t ) + a, 

Regarding multiplication by a complex number as an U 2 transformation, namely 

SiZ = Si^zi + is ix zi + is i2 Z\ - s i2 zi 
= (s,-,zi - s i2 zi) + i(*/,Z2 + s i2 z\) 

_ ( S h S; 2 

\s i2 J/, J \Z 2 J 

this transformation is equivalent to the transformation 

-G)-te ?)(r:;H:;) 

from which we can read off the fixed point as a h Note that s f is a rotation and scaling, making the whole operation a 
similitude. 

Answers to Chapter III, section 1 1 

III 11 3 In order that the attractor of an IFS be two lines joined at the origin, and variable with a parameter p, we may pick 
three maps as follows: Let w^x, y) shrink the angle towards the origin with no rotation. This creates the angle again, with 
smaller clock hands. To get the hands back to the same length, create two maps which map the whole attractor to the tip 01 
each hand, that is to a line at the same angle as each hand, with a fixed point at the end of the hand. Consequently, choose 
wi to be 

which shrinks the clock face slightly. To collapse the attractor to a line of slope 9, we need 

^ y)= (c d)C) = («+S0 

such that 

cx + dy 

^-=tan0 V x,y 

ax + by 

This is most easily accomplished by starting with the relations 

c d 

- = - = tan 6 
a b 

for which one solution is the matrix 

A _ ( cos ^ cos 0 \ 
— y sin 9 sin 6 J 

This matrix has det A = 0 and generates the proper slope. Now suppose we want the length of the new line to be less than 
1. The clock hands can be any set of lengths, we will take the minute hand to be length 1, and the hour hand to be length 
1 /2. Choosing the minute hand itself, we have 



/cos0\ / 
\sin<9; v 



cos 2 9 + sin 8 cos i 
sin 2 6 + sin 6 cos < 
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so that the right hand side is of length less than two. Add to this the possibility of the whole other hand, which must be less 
than one, and we have to divide this matrix by 3 to assure that it fits within the given hand. So let 



= ( 



|cos# |cos# 
|sin# |sin# 



Now we arrange to put the fixed point at the end of the hand. 

/ cos#\ _ / cos0\ ( e \ 



cos 2 9 sin 9 cos 9 
~ 3 3 
sin 2 9 sin 9 cos 9 



f — sin i 

We now let 9 h = n/2 - (n/6)p be the hour angle, which increases clockwise from tt/2 so that it reaches In when p = 12. 
We let 9 m = tz/2 - (2/n)p which travels the same direction but reaches 2n every hour. Having arranged that 

w m (x, y) = A\ e=9m (^j + ^ e j^j 

will be the map to the minute hand. The map to the hour hand is similar, but we must shorten the hand by a factor of 2. So, 

^,,)=^(;) + (; : ) 

where A h — ^A and 

cos 2 9 sin 9 h cos 9 h 



e = cos 9 h — 
f = sin9 h - 



6 6 
sin 2 9 sin 9 h cos 9 h 



Answers to Chapter IV, section 1 

IV.1.1 The addresses are a = 2121 122, b = 333, and c = 12231222. the height, and half the width, and then, depending on 
the map, moves it to the appropriate corner of the square. To shrink the square by one half on each side, we define the map 



and assign the symbol 0 to an iteration of this map. To get the quarter labelled 1 , we must do the same shrink, and move the 
lower left corner over by 1 /2 in the x direction, which gives the map 



An iteration by w 2 corresponds to an address symbol of 1. The third map must move the lower left corner to (1/2, 1/2), 
consequently we define 
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An iteration by w 3 corresponds to the address symbol 2. Finally, the last map moves the lower left corner up by 1/2, giving 
and an iteration by this map gets the address label 3. 

IV.1.4 The easiest way to see this problem is to look for a subset of the IFS maps which have as their attractor one of the 
sides of the Sierpinski triangle. Consider the IFS {1R 2 ; w u w 2 } giving rise only to addresses which contain no 3's. If a point 
is on the a; -axis, neither of the two IFS 



C)-G DC) 
(;)=a ?) + G 



Either of these two will return a point on the x-axis to the x-axis, and their attractor is the line between 1 1 and 22. On the 
other hand, any point with a y component not equal to zero will on a single iteration have a new y value equal to half the 
original, which is still non-zero. Hence this edge of the Sierpinski triangle consists of all points whose addresses contain 
only 1 and 2. 

We may use identical arguments to verify that the points along the left edge of the triangle have addresses containing only 
l's and 3's. The final edge contains all addresses containing only 2's and 3's. 

IV.1.6 One scheme is to use the Cartesian product of the two codespaces. Let the IFS with condensation be {X; w 0 , w u ..., 
w N } and the IFS whose attractor A is used for the condensation set {X; u,, u 2 , . . . , v M }. The attractor B for the IFS with 
condensation is then the union of the successive iterates 

oo 

B = |J W n (A) 

Any point on B may be described by giving the sequence of iterates done on A to get to the particular image on B of A, 
combined with the address on the image, which corresponds to an address on the attractor A. Consequently, we have two 
code space strings, possibly on two code spaces, which give the whole address. Any address can occur. 
IV.1.8 This is part of the proof of Theorem 1 of section 4.5. Let {X; w u ...,w N ) be the IFS in question, let its contractivity 
be s, and let its attractor be A. Consider the images of A for two points which are near by in code space. That is let some 
small e > 0 be given, and let x = x x x 2 x 3 ... and y = y,y 2 y 3 • • • be two code space points d(x, y)<€. The points in A having 
these addresses are contained in the images 

x e w Xl o w X2 o ■ ■ ■ w Xn (A) 

y € U>j| o o • • • Wy n (A) 

Now, for e small, we may find a number n such that 

1 



(N + 1)" 

which means that x, = y, for all i < n. Then both points x, y are contained in 

w xx ow X2 o--- w Xn (A) 

since the contractivity is s this image must be smaller than s n ■ diam(A). Hence the closer the addresses in code space, the 
closer the points having those addresses are on the attractor. 

IV.1.12 The scheme which is used in Figure IV.99 is the following: Write the numbers with the same denominators in 
horizontal lists beginning with the denominator 1, and putting each successive denominator under the list before. Starting at 
the upper right hand corner, go down if we have just arrived at the left edge, go right if we have just arrived at the top edge, 
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go up and to the right if we are leaving the left edge, or have just gone right and up, go down and to the left if we are leaving 
the top edge or have just gone down and left. This winds us through the rationals as shown below: 



I 2 3 4 5 

1 11 11 

IS / / / 

1 2 3 4 5 

2 2 2 2 2 

/ / / 

1 2 3 4 5 

3 3 3 3 3 
i / / 

12 3 4 5 

4 4 4 4 4 



There are two things to note: The scheme works regardless of the entries as long as every rational is in some horizontal 
list, with the lists ordered in a countable manner, and, all redundant terms are not in lowest terms. Since any horizontal list 
becomes 'shorter' but still infinite if such terms are removed, we may use the same scheme if we agree to write down only 
those numbers in lowest terms in each horizontal list. To get the negative rationals, put the negative list corresponding to 
each positive one in below it. That is, the second row is numbers with denominator — 1 the fourth with —2 and so forth. This 
method of converting the rationals to a linear list indexed by the counting numbers was invented by Cantor. 

IV.1.13 The triangles in the Sierpinski triangle can be labelled by size: There is 1 triangle of size 1, 3 of size \ and in 
general 3" triangles of size ^ since on each iteration of W(A) we generate 3 times as many triangles of | the size. Suppose 
we label each of the triangles with size ^ with the labels 

3" + 1,3" +2,..., 3" +3" 

Since this last term is 2 ■ 3" < 3 n+1 , each is a unique label, there is a label for each triangle, and the collection of labels is a 
proper subset of the positive integers. The number of triangles is therefore countable. 



Answers to Chapter IV, section 2 



IV.2.1 The IFS {R; \x + ±} is just touching. Let A be the attractor, and let O be the set 

0 = (O,i)nAU(i,l)nA. 



Then we have 



which are disjoint. By construction 



«fi(O) = (0,j)U(^) 
4 4 2 

u> 2 (0) = (i ^)U(5 1) 



Furthermore, O is open because it is the finite union of open intervals. The IFS is not totally disconnected because it has the 
attractor 

A = [0, 1] = [0, 1] U [i, 1] = Wl (A) U w 2 (A) 

and ifi(l) = w 2 (0). 
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IV.2.2 The attractor of the IFS {R; \x, f jc + J} is [0, 1] as follows: 

u>i([0,l]) = [0,^] 

^ 2 ([0, 1]) = 1] 
4 

iw,([0, l])Um 2 ([0, 1]) = [0, 1] 

Since there is a unique attractor with the property that W(A) = A this is it. Furthermore, there is an open set (J, |) interior 
to the overlap between the images, that is 

11 11 
(-, -)Cui 1 (A)nu) 2 (A) = [- ) -] 

so the IFS is overlapping. 

IV.2.3 The first thing to show is that the attractor of this IFS is [0, 1], as follows: For any affine transformation f(x) = 
ax + b on IR, an interval transforms to either 

J [ac + b,ad + b] if a > 0 
lC ' [ [ad + b,ac + b] ifa<0 

For each w n here, a = ^ > 0, so we have, 

n - 1 n , 
^([0.1)) = [— .-] 

Thus the union of these is 



io t 9 

W([0, 1]) = |J w n ([0, 1]) = [0, — ] U • ■ ■ U [-, 1] = [0, 1] 



10 10 

Since the attractor A is unique, this is it. The IFS is clearly not totally disconnected because w n ([0, 1]) n w n+i ([0, 1]) = {^5}. 
However, let 

0 = M ( !LLL,i )01 
^ v 10 10 

Then 

w n (0) C u,„((0, 1)) = (^-, ^)cO 

and for any i ^ j 

w,(0) n w/O) c Wi((0, l)) n u;,((0, l) = ^) n (^-, ^) = 0 

So the attractor is just-touching. 

IV.2.4 Our IFS is {[0, 1]; w x = \x, w 2 (x) = \x + §}. The attractor A is a subset of [0, 1], and wi([0, 1]) = [0, |] and 
u> 2 ([0, 1]) = [§, 1] so that wi ([0, 1]) n iu 2 ([0, 1]) = 0 Then 

n w 2 {A) C wi([0, 1]) n iu 2 ([0, 1]) = 0 

so the IFS is totally disconnected. 

IV.2.6 We have w x (x) = \ and iw 2 (jc) = \. The attractor is clearly A — {\\. The images of the two maps intersect, so the 
IFS is not totally disconnected. Furthermore, since the entire attractor is in the intersection there is clearly no open set O 
satisfying the conditions for a just-touching IFS. Consequently the IFS is overlapping, by default. 
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IV.2.8 The smallest number with no non-zero entries in its decimal expansion is . 1 . Any other choice, with non-zero entries, 
must have at least one digit which is greater than 1. Reducing this digit to 1 produces a smaller choice. Hence reducing all 
of them to 1 produces the smallest. 

IV.2.10 Let {X; w x , w 2 , ■ ■ ■ , w N } be our IFS with attractor A. We first prove / -<=>■ //. Let x be a periodic point. Then by 
definition there exists a finite sequence of transformations w a{ , . . . , w„ n such that 

w„ x o • • • o w an (x) = X 

If we define f(x) to be this transformation, we have 

f(x) = f m (x) = X = {W a{ O • • • O W an ) O ■ • • O (W ai O ■ ■ • O W an )(x) 

consequently x has the address <xi • • • <x„, which is a periodic address. Now suppose that x has the periodic address o x ■ ■ ■ a n . 
Then x is given by 

4>(a { ■ ■ ■ a n ) = x = <p{o\ ■ ■ ■ o n O\ ■ ■ ■ a n ) 

= w a] o • • • o w an <f)(o { ■■■<J n ) = w ai o • ■ ■ o w an (x) 

so x is periodic. We next prove / ///. The semigroup of transformations generated by {wi, . . . w N } includes all finite 
compositions of the maps for the IFS. In particular since w a{ o ■ ■ ■ o w an is a finite composition of these maps, it is an element 
of the semigroup. It's fixed point is x. Now suppose that x is a fixed point of an element of the semigroup of transformations 
generated by these maps. Then by definition of the semigroup, there is an element of the form w a[ o ■ • ■ o w„ n with the 
property that 

Consequently x is periodic. 

IV.2.11 We wish to show that any periodic point of the IFS {[0, 1] :\x,\x + \) is of the form p/(2 n - 1). This is certainly 
the case for the two fixed points 0, 1. Choose any point x in [0, 1]. Then we claim that after any sequence of k operations, 
we have 



1 

2*' 



] Jo k (x) = — b x + C 



for some number C. This is true since each iteration multiplies x by | and multiplies any constant term by the same factor, 
and possibly adds another part to the constant term. If x is k periodic then we have 

1 C2 k 
x = —x + C or x = 



2 k (2 k - 1) 

Now C is composed by taking the constant term from the last iteration and multiplying it by | and either adding a half or 
adding 0. Thus the binary expansion of C contains at each digit a 1 if ^ was added on that iteration, and a 0 if it was not. 
Hence C has a binary expansion of length k, which, multiplied by 2 k gives any possible number from 0 to 2 k — 1, as desired. 
IV.2.14 The IFS vo x (x) = 0, w 2 (x) = ^x + ^} is an IFS with condensation, the condensation set is {0} and consequently 
the attractor A is given by 

oo 

A = |Ju; 2 "(0)U{l} 

n=0 

Any point p e A is periodic since there is some k for which p = w k (0) unless p = 1 which is a fixed point and therefore 
periodic. But this means that 

p - w k (wi(p)) = w k (p) o w { (p) 

so p is k + 1 periodic, a totally disconnected IFS is invertible, it follows that the cardinality of A is the same as that of 
its related code space. But code space on more than one symbol is uncountable by Theorem 1.1 in Chapter IV. Hence the 
attractor is too. 
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IV216 Any uncountable attractor which is overlapping has uncountably many points with multiple addresses^ This is 
because it contains an open set of points with multiple addresses which maps to an open set in code space by 0 . Any 
open set with non-empty interior in code space on more than one symbol is uncountable. 

IV2 17 One example of such an IFS is any overlapping IFS of two maps on the interval [0, 1] such that the end points of 
the overlapping region w\(A) n w 2 (A) are 3-periodic. This causes some of the sets of 4 symbols in any given position to 
map to the same point on the attractor, allowing uncountably many addresses for some points. 

IV.2.18 The set of interval end points in C is equal to exactly twice the number of intervals used in the derivation. Following 
the proof of the countable number of triangles in the Sierpinski triangle, (exercise 1.13 in Chapter IV), we have at each 
subdivision. 2" intervals added, and we number them 3" + 1, 3" + 2, . . . , 3" + 2- < 3-. Since this numbering scheme 
produces an integer in order for each interval without repeating, the intervals have been put into correspondence with 
^ positive integers. Then their end points can too, since we may give all the left-hand end points the negative integer 
corresponding to the right hand end point labelled as above. Since the integers are countable so are the endpoints. 

Answers to Chapter IV, section 3 

IV.3.10 We first show that the point 12l2 is a periodic point of period 2. 

/(1212) = 212 = 2121 
/ 2 (12l2) = /(2121) = 127 = 12T2 
so this is_a periodic point of period two, and the other element of the cycle is 2121. Next we show that it is repulsive. Let 
x f = 1212, and y € S. Then 

d(f 2 (xf),f 2 (y))=d(x f ,f 2 (y)) 

^ |*,--y«l _ a - y l*-y«l 
-2^ 30-2) ~ J 4-i y 

= 9 d(x f , y) - (3|1 - v,| + |2- y 2 \) 
If we choose J > e > 0 then for y € B e (x/) we have y , = 1 and y 2 = 2, so for any such e 

d(x f , f 2 (y))>9-d(x f ,y) 
and the point is repulsive, the cycle is therefore repulsive too. 

IVJ.11 The dynamical system {[0, 1]; 2x(l - x)} has an attractive fixed point at x f = \. /(0) = 0 so that 0 is also a fixed 
point of the system. To examine the behavior near 0, we first notice that if x * 0 then as x -> 0 we have (1 - ■ x) -* L 
Consequently, as we approach 0 the function approaches /(*) - 2x. Since for any x > 0 we have d(2x 0 = Id (x, 0) > 
d(x,0) it is easy to find an € such that the function is repulsive on an ,-ball around 0. For example, let e - .1. Then 
d(0,f(x)) > 1.8^(0, x) for any x € B ( (0). 

IV.3.14 We want the behavior of the dynamical system {R U {00}; f(x) = Xx} in four cases: 

(i) X = 0 gives us f(x) = 0 which makes 0 the only fixed point, and attractive. 

(ii) 0 < |X| < 1 • 00 is a repulsive fixed point since < likewise, 0 is an attractive fixed point. 

(iii) X = - 1 0 is a neutral fixed point, as is 00. Every other point is period 2, and neutral. 

(iv) 1 < X < 00. Since > x for any x * 0, 00, 0 is a repulsive fixed point, and 00 an attractive one. 



IV.3.17 We have 1 

J x + 1 ifx <0 
/W + ! ifx>0 
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If x = 0 then f{x) = 1 and f 2 (x) = 0 so that {0, 1} is a cycle of period 2. For any other x split x into its integer part and 
fractional part, that is 

x = n + x \x\ < l,n eZ 

Then if x < 0, f n (x) = x. which is between -1 and zero, and f n+] (x) — 1 + x which is between 0 and 1. Applying / to this 
number gives f(l+x) = —x, which is again greater than zero, and 

f 2 (l+x) = f(-x) = l+x 

which is a cycle of period 2. Consequently for any x < 0, x is eventually periodic of period 2. For x > 0, we have 

f(x) = f(n + x) = -(n-l)-x 

so that redefining n := — (n — 1) and ic := — x. gives us the case just examined, and these points are eventually periodic of 
period 2 as well. 

IV.3.18 If {X; w\, . . . , u>jv} is a hyperbolic IFS, we may assume a contractivity s < 1. By Theorem 7.1 in Chapter III, the 
dynamical system {H(X); W] possesses a unique fixed point A e H(X). Also by Theorem 7.1, for any B,C 6 H(X) we 
have h(W(B), W(C)) >s h(B, C). Consequently, 

d(W(A), W(B)) = d(A, W(B)) >s-d(A,B) 

and the fixed point A is globally attractive (globally attractive means e may be chosen any size). 

IV.3.20 We need a small lemma to proceed. If a space is compact, then for any e > 0, there is an e-net {y,-}"! 6 / of points 
such that every point in the space is less than e from some element of the net. We need to produce a sequence of such nets. 

Lemma: Given €\ > €2 and an € r net {yi}"^^ there is a minimal e 2 -net {Zi)"=f such that C {z, }. 

Proof: There is always some e 2 -net which has the e r net as a subset, since given any e 2 -net {Zi}"= 2 \ the finite collection 
of points 

is an € 2 -net containing the ei-net. Now taking the collection of all e 2 -nets containing this particular ei-net, there is an 
"(^2) > "(fi) which is the minimum size of any of the nets in the collection. Let {z,}"!^ ^ e one °f these. 
Now suppose that our code space has the metric 

, \xj-yi\ 
i=\ K 

with some k > 1, perhaps k = N + 1. Let e, ■ = -i, and, with the help of the above lemma, construct a nested sequence of 
6,-nets, 

{yiih={ cb> 2 ,.}, =1 2 ciy^iJ c--- 

We build a sequence of points [xj}^ by letting 

x J = y\ l --- y\ j+1 yi ] ■ ■ ■ yi j+] ■ ■ ■ ■ ■ ■ yn( fj ) j+l 

By construction, Xj visits each e 7 ball around each element of the e 7 net by repeated applications of the shift transformation 
T. But code space is compact, and therefore the sequence {Xj}^ =] must have a convergent subsequence, which converges to 
a point x which visits every neighborhood in code space. Hence x has a dense orbit under the shift transformation. 
IV.3.21 Let A, B be two open sets in code space. Choose an e > 0 and points a A 6 A, and a B 6 B, such that B f (a A ) c A. 
Then if the metric for £ is 

\xi - yi I 
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there is an M such that for any <reE such that 

g x = a A{ , a 2 = a Al , ■ ■ ■ a M = o AfA 
we have o € B € (a A ) C A. Then we only need to choose 

o = o\ x ■ ■ ■ °a m vb 

which is in A, but T M {a) =a B e B. Since by choosing the sequence of ct's which agree on M, M + 1, M + 2, . . . first 
symbols with a A , followed by a B , as above, we get that for allm > M there is an element of A such that T m (a) e B, T is 
mixing on code space. 

Answers to Chapter IV, section 4 

All the problems are examples or figure exercises. 

Answers to Chapter IV, section 5 

IV.5.1 Let {Xi; f\] and {X 2 ; f 2 ] be equivalent dynamical systems with the equivalence provided by the homeomorphism 
9 : Xi — ► X 2 . Let {x\, x 2 , ■ ■ ■ , x p ) be a cycle of period p in Xi. That is fC(Xj) = and 

' x ;+ i if i # p 



fdXi)= [ Xl ifi = p 
Then 

Md( Xi )) = 9of x o9- x o 6{ Xi ) = O(fdXi)) 

and 

//to) =flo/ 1 or 1 o^o---o/ 1 or l o e(Xi) = e{xi) 

so that {d(x\), . . . 9{x p )} is a cycle of period p. To show that the statement holds in reverse, define the homeomorphism 
6 2 : X 2 — ► Xi by 0 2 = and repeat the above argument. 

The second statement, regarding attractive fixed points, is highly dependent on the restricted definition of the term given 
in the text, and, since it is not preserved by homeomorphism must be dependent on the metric. Choose two copies of the real 
line, (K, d x ) and (K, d 2 ) where d\(x, y) is the usual metric, and d 2 (x, y) is given by 



J|3*-3>>| if*<0 
d 2 (x, y) = \ 

[\x — y\ nx>0 



The identity map is then a homeomorphism between these spaces. Now choose the function f(x) = — \x for both systems. 
In (K, d\), 0 is an attractive fixed point. For (K, d 2 ), no matter what the size of the e ball is around 0, it will include points 
on the positive side that, because the ball is shrunken on the negative side (for example, fli(0) = (—1/3, 1)), fail to land 
back in the ball on one iteration. Hence this is not an attractive fixed point, even though the two systems are equivalent. 
IV.5.2 An eventually periodic orbit is has the orbit {x x ,x 2 , . . . , x n , x n+x , . . . x n+p = x n+ \), for some n and some period p. 
As in exercise IV.5.1, we write 

f( r \-l Xi+l iU ^n + p 
MXi) -\x n+{ if i=n + p 

The argument is then identical to that in IV.5.1. 

IV.5.3 We modify the example in IV.5.1. Let (K, di) and (K, d 2 ) be as in the solution to problem IV.5.1 . Let the homeo- 
morphism 9 : R -> M be 9{x) — x. Then if f x (x) = -2x, under our homeomorphism we have f 2 (x) = -2x. However, in 
(K, d\) 0 is a repulsive fixed point, but given any e > 0 a point x < 0 such that x e B € (0) moves under 1 iteration to a point 
which is closer to 0 under the d 2 metric than it started. 
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IV.5.4 9 is a homeomorphism, which means that 9 and 0~ l are continuous. This implies that 9~ l (0) is open if O is open 
and (0" ] )" l (O) — 9(0) is open if O is open. Consequently, if there is an open neighborhood of x f in Xi, then 9{0) is open 
in X 2 and 9(x f ) e 9(0) implies that there is a neighborhood C of 9(x f ) in X 2 such that C C 0(O). Suppose that for any 
yeOwe have 



lim f?{y)=x 



Then since 9 is continuous 



Hm f 2 "(9(y)) = lim 0(/, n (y)) = 9(x f ) 



for all 9{y) e 0(C), in particular for all elements of C. Then x 2f =9(x/) is an attractive fixed point of f 2 . Note that this is 
the standard definition of an attractive fixed point. We will use the more restrictive version given in the text, however. 
IV.5.5 The solution above to IV.5.4 shows that this definition of attractive fixed points is preserved under equivalence and 
conjugacy. (and in fact does not require that the spaces be metric spaces). All that is needed is to give a similar definition 
of a repulsive fixed point. For invertible maps, this is done by saying that X/ is a repulsive fixed point if x f is an attractive 
fixed point of the map /"\ that is, there is a neighborhood O of x f such that for every y e O, lim^co f~"(y) — x f . For 
non-invertible maps, this is extended by saying that there is a neighborhood O of Xf such that for every y e O there is an n 
such that f n (y) & O. This is likewise preserved under equivalence and conjugacy, a moment's reflection says that if this is 
true for O, then it is true for any neighborhood 0 { C O. 

IV.5.7 We want to show that if 9, a homeomorphism establishing the equivalence of {[R; /} and {R\ g) is a diffeomorphism, 
then the definition of an attractive fixed point given in the text is preserved under the equivalence. Let x f be an attractive 
fixed point of {[R; /}. The definition here establishes that a fixed point is attractive if and only if it is a contraction on some 
e-ball around x f . We let 9(x f ) — x g be the corresponding fixed point of {[R; g}, and we will show that g is also locally 
a contraction. Choose e > 0 such that / is a contraction on B ( (x f ). Then for y e B ( (x f ), there is an s, 0 < s < 1 such 
that \ f(Xf) — f(y)\ < s\xf — y\. Since 9'(Xf) ^ 0, there is an e small enough (the derivative is continuous by assumption) 
such that for any y e B ( (Xf), the following expression is well-defined (i.e. none of the fractions are equal to zero in the 
denominator) for y ^ xf. 

9(f(x f ))-9(f(y)) 



f(Xf)-fiy) 



i f - y 



9(x f )-9(y) 



In the limit as y x f , the two fractions on the right approach, respectively, \9'(x f )\ 
close enough to Xf, we have 



9'(x f ) 



= 1. But since / is a contraction 



'i-y 



f(x f ) - f(y) 



> - > i 

s 



which implies that 



9(f(x f ))-9(f(y)) 



9(x f )-9(y) 



< s 



Then for e > 0 close enough to 9(xj), we have 

\0(f(x f )) 



9(f(y))\<s\9(x f )-9(y)\ 



so that|g(^) - g(z)\ SslXg 



for z close to x„, and the fixed point is attractive because it is a local contraction. 



IV.5.8 Assume /} is a dynamical system such that / is differentiate for all x e 1R. A web diagram for a given initial 
condition jc 0 starts at the point y = x 0 , and successive points are then taken from (x n , x n ) m> (x n , x n+ \) m> (x n+ \, x n+ i) where 
x n+] = f(x n ), and the points of the form (x n , x n+ \) are on the graph of the function f(x). It follows that if a point is fixed, 
x n — x n+l = f(x n ) so that this is a point of intersection of the graph of f(x) with the line (x, x) or y = x. 

To prove the second assertion, we note that for our definition of attractive fixed points we have an e > 0 such that for any 
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y e B ( (x f ), we have a constant 0 < s < 1 such that \f(y) - f(x f )\<s\y - x f \. Then for y # x f we may write 



1/00 -/(*/)! 



/O) - /(*/) 



y ~ x f 



\y-x f \ 

Since this is true for all y e B ( (x f ), it is certainly true as y -> x f , yielding 

/O) - /(*/) 



< s 



lim 

y-+ x f 



y-xf 



= \f'(x f )\<s < 1 



To go the other way, if the derivative is continuous in a neighborhood of x f , and \f'(x f )\ < 1, it follows that for some 8 > 0 
there is a neighborhood fi^x/) such that for any y e l/'OOl < 1. We may then find an 0 < e < 8 and a number 

5 < 1 such that l/'OOl < ^ for all yeB f . Then the map is a local contraction since 



lim 

y-*yo 



f(y) - /Oo) 



y - yo 



< s 



implies that close enough to y 0 , we have 



l/0)-/Oo)l <s\y-y 0 \ 



in particular, I/O) - f( x f) \ < s\y - x f \, as desired. 

Finally, to generalize to cycles, let x u x 2 , . . • , x p be a cycle of period p, and suppose that the cycle is attractive. By 
definition, this implies that x, is an attractive fixed point of the dynamical system {R; f p ], for some x, in the cycle. By the 
result above, we have |(/ p )'(x,)| < 1 for this point. This says that 

l(/ p )'0,)l = l/'O,) ■ • • /'Up)/'(*.) • ■ ■ /'U/-i)l < 1 

Notice if this condition holds at x, it holds at any xj in the cycle since the expression is the same product. If this product is 
less than 1 , it follows that for at least one of the elements in the product | f'(xj)\ < 1 , but it can also be the case that some of 
them do not satisfy this condition, only their product must. 

IV.5.9 Since we would like our IFS to have an associated shift dynamical system equivalent to 

x e [0, \] 



fix) 



ri-2* 

\2x - 1 



xe[\,\] 



we look at an IFS which is the inverse of f(x) on the regions in question. Consider the IFS 



1 1 

{[0,1]; -* + -, 



The attractor of this system is [0, 1] as follows: 



W([0, 1]) = ^([0, 1]) + ^U -i([0, 1]) + \ 

111 1 

= [0,-] + -U[--,0] + - 

= [i,l]U[0,^] = [0,l] 



That this IFS is just-touching results from the intersection above containing the point \ , and the fact that O = (0, \ ) U ( \ , 1 ) 
suffices for the open set. The points of multiple addresses are the images of this point. Denning the associated shift 
dynamical system, we have 



\w x \a) ifae(±, 
bKa) -\w- 2 \a) ifae[0, 



L 
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The point of intersection yields 

TV" 2( 2"2 )=0 

so we define 



and there is no conflict. This explicitly gives 5(a) as 



,2a -I ae[\,l] 
S(a)={ l-2a «E[0,i] 



which is clearly equivalent to f(x) with the homeomorphism being the identity on [0,1]. 
IV.5.10 When p = 0we have 



Wi 



Wi 



Wi 



.(;)-« 5)0 

O-O 5)C) + 0) 
(3-0 !)GM» 



If we let )> = 0, then 



WiU,0) = (^,0) 
4 

U>2(*,0) = (^* + |,0) 

4 8 

Wi(jc,0) = (^ + ^,0) 
4 4 

so that for any point on the ;c-axis, Wi(x, y) is also on the ;c-axis. We now consider the image of the set [0, 1]. This maps 
to three subintervals, each of length \ and arranged with their left endpoints at 0, f and \ , respectively. The set formed by 
repeated iteration removes two subintervals from each interval formed by the previous iteration, and the attractor must be 
lim n ^oc W"([0, 1], 0). Thus for p = 0, the attractor is a Cantor set. 
For p — 1 we have 



Wi 



o=a DC 

(;)-« ;)(;Wi) 

o=(o dm 



which is a just-touching Sierpinski triangle. 

We next prove that for fixed aeE, the map <p(p) A(p) is continuous in p, for p e [0, 1]. We may write the three 
transformations in matrix notation as u>, (-*) = Ax + t ( , where 



Selected Answers 481 



A 



U 

so that we have 

itVj o w a2 (x) = A 2 x + At a2 + tu j 

in general, we have 

w ax o w a2 o ■ ■ ■ o w an {x) 

= A"x + A"- X t an + A n -\ n _ x + --- + At a2 + t ai 

so that 

n oo 

<t>(p) = lim A"x + £ = E A Vi 

Writing this out for fixed a, and letting 

- (/;) 

we have 

(pAp) = I ™ (1±£V , I 

which are continuous functions of p with a fixed (they are, for fixed a, differentiable in p, being a form of Taylor series 
expansion). 

That the set J{x) = {y e A(0) : d(y) = jc} where 0(jc) = 0(1)(0(O) _1 (jc) is the set of y whose paths meet at x is true by 
construction. 

Answers to Chapter IV, section 6 

IV.6.3 The three points in question have addresses 13 = 3l, 12 = 21, and 23 = 32. Defining 5(a) = w^(a) forces the point 
a to become 12, which is eventually periodic since T(\2) = 2, a fixed point. Similarly, if we define 5(a) = w 2 l (a) we force 
the point a to become 21 which since T(2l) = I is also eventually periodic. The points b, c are analyzed similarly, being 
forced to either of the two addresses, which after one iteration land on the fixed points 2 or 3. 

IV.6.4 If we create an IFS for the equilateral Sierpinski triangle in which we rotate the subtriangle corresponding to w 2 (A) 
by 240° and that corresponding to iu 3 (A) by 120° we get a just- touching Sierpinski triangle in which the multiple address 
points correspond to equating the addresses 

13 = 33 3l = 2l 12 = 22 

Since these points, regardless of which map is used in the equation 5(a) = w^ l (a) map to the points 3, 1, and 2, on one 
application of the map, 5(a) may be chosen without conflict. Specifically, the IFS in question is 



"V o ^) 



(0, 0) 

y 8 ' 2> 

(^,0) 



= 1 

= 2 
= 3 
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w 2 



o-tt nc 

(x\ = /icos240° -Isin240°\ ( x \ / l\ 
\y J \ \ sin 240° | cos 240° / \y ) \ 0 J 

(;)-( 



2 cos 120° -| sin 120° \ /x w 4 



m \ v I - I I sin 120° i cos 120° ^ ^ / V ^ 

IV.6.6 The IFS {E; ?i, . . . t N } where t n = no has by definition an associated shift dynamical system defined by S(a) = 
t~ l (<r) where a — o x a 2 ■ ■ ■. Then S(o) = a 2 a^ ■ ■ ■, which is exactly T (a). To see whether two such dynamical systems can 
be equivalent for different values of N, we use exercise IV.5.1, namely if two dynamical systems are equivalent, then a cycle 
of period p maps to a cycle of period p. Given N\ > N 2 , we have for either that any of the cycles of period 1 are given by n 
where 0 < n < /V, . These are distinct, and if 0 were the homeomorphism establishing the equivalence, would map to distinct 
fixed points. But this system for has /V 2 such points, which is less than N\. Thus 

9 : Etfj —> £/v 2 

either does not map fixed points to fixed points or is not one-to-one, in other words there is no such equivalence. 
IV.6.14 First we prove, as claimed in the text, that 

A = {(0(a), a) :o e E} 

is the attractor of the lifted IFS. To do this we need to confirm hyperbolicity, in order to assert the existence of a unique fixed 
point in 7i(X x E). Let the metric on X be denoted by d x and that on E be denoted c/ E . To make things simple, we use a 
Manhattan metric on the product space, namely, 

</((*i, ai), (x 2 ,a 2 )) = d x (xi,x 2 ) +d< £ (a ] ,<j 2 ) 

We have, by definition, 

ibi((x, a)) = (wi(x), ia) 

and we must show that this is a contraction. u>,- are the maps of the original IFS, and we may assume that that system has a 
contractivity of s. Then 

1 ~ -az.-l 
d(Wi(xi,ai), Wi{x 2 ,a 2 )) =d x (wi(x{), u;,(x 2 )) + > 

/V + 1 j-f (N + l)J 

1 

< sdx(x u x 2 ) + xr , dz(au a 2 ) 
N + 1 

Letm = max(s, j^)- Then 0 < m < 1 and 

d(Wi(x\, <7i), tdi(x 2 , a 2 )) < md({x\, (J X ), (x 2 , a 2 )) 

so each map iD, is contractive, and the IFS is hyperbolic. Consequently since X and E are complete, and therefore so is 
X x E, we have the attractor A is the unique fixed point in H(X x E) of the map 



W(B) = \Jwi(B) 



for B a compact, non-empty subset of X x E . We now verify that the attractor A is as given in the text. For any element of 
the set A = {(0(a), a) : a e E} and some iD, we have 



Wi(A) = {(Wi{4>(o)), ia) : a e E} 
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but iff = </r V.WKtf))) so the image W(A) is in A. Let a e A. Then a is given by 

a = (u; ai (r(a)),a 1 r(a)) 

so a is in the image of an element of A under w 0l . Hence W(A) = A and this is the unique attractor of the lifted IFS. Finally, 
to complete Lemma 1 , we have 

W\(A) = {a e A : a = (x, lcr)} 

w 2 (A) = { a eA:a = (x, 2a)} 
These sets are points with mutually exclusive sets of second coordinates, so their intersection is empty. Hence A is totally 
disconnected. , 
IV.6 15 Let be any orbit of the associated shift dynamical system {A; S}. Then by definition, S(x n ) = x n+x = u>, (x n ) 
for some w h Thus, regardless of the rule for assigning S(a) on overlapping regions, there is a rule of assignment which 
assigns either w~ x x or w~ 2 x to each application of S to the orbit. Let x be the address such that if S(x„) = w, (x n+i then 
r„ = i . Let 

Then0(ff n ) = x n , a n+x = T(a n ), so that the orbit {(*(*„), a n )} of the shift dynamical system {A; ~S} has the desired property. 
IV.6.16 By definition, the lift is (a u a 2 ) h> (Ta u a h a 2 ) which looks like the shift automorphism: it takes the symbol off 
the first element and puts it on the second. The suitably defined inverse would then be to take a symbol off the second 
element and put it back on the first. This defines a system on E x E, as the space of shifts, which is identical to the shift 
automorphism. 

Answers to Chapter IV, section 7 

IV.7.2 x 0 = 212 consequently the true orbit of x 0 is given by 

f 0 = 212 T(x~ 0 ) = \2 T\x~o)=2 
T"(x 0 ) = 2 forn>2 

By contrast, we have x n = 12 after the first iteration, so that d(x~ n , T»(x~ 0 )) = 1/3 after the second iteration, under the code 
space metric for two symbols. We wish to find a shadowing orbit. Taking B = 1 /3 and s - 1 / 3 we need 

s9 1 

d(x n , x n ) < — = - 

\ — s o 

Any real orbit which hasthe symbol 2 in it will eventually be separated from 12 by at least 1/3 so the only possible choice 
for a shadowing orbit is 1. Computing the distance yields 

^2.D=3^F = T3I- 1 = 6 

which satisfies the shadowing theorem. 

Answers to Chapter IV, section 8 

IV.8.1 It suffices to show that the positive rationals are dense in the positive reals, since the function 9(x) = -x will show 
that the negative rationals are then dense in the negative reals. To this end let r e R, and let its decimal expansion be 
.H r 2 r 3 .... Then the sequence of rationals 

•nO, .rir 2 0, .nr 2 r 3 0, . . . 

converges to r. Hence every number r e R is a limit point of the rationals, and since the closure of a set contains its limit 
points, the closure of the rationals is the reals. Hence the rationals are dense in R. 
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IV.8.2 The sequence in question is an enumeration of the rationals in [0, 1], i.e. Q D [0, 1]. Since Q is dense in R, Q n [0, 1] 
is dense in [0, 1]. 

IV.8.3 The function f(x) = 2x mod 1 can be viewed as follows: On the circle of radius l/2n, that is, of circumference 
1, define x(0) to be the distance along the circumference in the counterclockwise direction from the positive x-axis. Then 
taking the usual equation of angles 0 = 0 + 2n = 0 mod 2n , the function 0 k» 20 gives us our function on [0, 1]. If 0 is not 
a rational number times 2n, it cannot be return to itself under this map, since 2"6 = 0 implies that 2" 6 = 0 + mn for some 
integer m. Hence any irrational number times 2iz for 0 results in a dense orbit on the circle. 

IV.8.4 The dynamical system {[0, oo); f(x) = 2x] is sensitive to initial conditions as follows: Choose 5 = 1, and for any 
€ > 0 let y jz x , and y e B f (x). Then since d(f(x), f(y)) = 2d(x, y), we have d(f(x), f n (y)) = 2 n d(x, y). Choose n, 
given y such that 2 n d(x, y) > 1. In particular, for any n > N where 2 N > l,d(f"(x), f"(y)) > 8. 

The dynamical system {[0, oo); /(*) = .5x) is not sensitive. In this case, we have d(f(x), f(y)) = 0.5d(x, y) so that for 
any e > 0 and any 8, there is an N such that for any y e B e (x) d(f n (x), f n (y)) = (0.5) n d(x, y) is less than 8. 
IV.8.5 Let £ be the code space on two symbols, and let T be the shift operator. Then {£; T] is sensitive to initial conditions: 
Choose 8 <\. Then if x, y e £ and x ^ y, there is an n such that x n ^ y n . Then since 

T"(x) = T n (xix 2 ...) = x n x n+x . . . 
T n {y) = T n {y x y 2 ...) = y n y n+x ... 

it follows that 

d(T n (x), T n (y)) > ~ ywl > 8 

Hence whatever e is chosen, we may find a y and an n satisfying sensitivity to initial conditions. 

{£; T] is transitive as well: Let U, V C £ be open sets. Then we can choose y e V and 8 > 0 such that B s (y) C V, and 
x eU and e > 0 such that B e (x) C U. Choose n such that jn < e. Then any v e £ such that 

v — x x x 2 . . . x n v x v 2 . . . e B f (x) c U 

Similarly, choose M such that for any m > M, 

" = y\y2 ■ ■ ■ y m u\u 2 ...e B s (y) c V 

Then any a e £ of the form 

° = y\yi ■ ■ ■ y m x\x 2 . ..x n a x a 2 . . . 
is in V. T"(a) = x x x 2 . . . x n o x o 2 . . ., which is in U. Hence T n (V) D U ^ 0, as desired. 

IV. 8.6 Suppose that {X; /} and {Y; g} are equivalent under the homeomorphism 0 : X Y. Let U, V C Y be open sets. 
Then because 0 is continuous, 0~ l (U), 0~ X {V) are open sets in X. Since {X; /} is transitive, there is an n such that 

0-\U)n f n o0-\v)^& 

Let x be a member of this non-empty intersection. Then 0{x) eU and x e f" o 0~ X (V) implies 

0 o 0~\U) n 0 o f n o 9~\V) = u n g n (V) ^ 0 

so that {Y; g} is also transitive. Reversing the roles of 0 and 0~\ which we may do because 0~ x is also continuous, yields 
the converse. 

Answers to Chapter V, section 1 

V. 1.3 Let (X,d) be a metric space, A c X with A = {a,b,c}. We wish to calculate the fractal dimension of A. Let 
r = min x , yeA (d(x, y)). Then for any e < r/2, we have Af(A, e) = 3. Hence the fractal dimension is 

In 3 

D(A) = lim =0. 

6 -oln(l/e) 
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V.1.6 Using the Box Counting Theorem to cover the classical Cantor set C with boxes of size 1/3", we will get 

/ In Af(C)\ 
D = lim ( , „ . 

n^oo\ In 3" / 

We need to calculate M n (C). It takes 1 = 2° box when n = 0, subdividing and deleting the middle third gives 2 = 2 1 boxes 
when n = 1. On each subdivision and deletion of a set of middle thirds, we divide each interval covered by a box of size 
1/3" in thirds and retain two of these thirds, so that it will take 2" +1 boxes of size l/3" +1 to cover the set at the next step. 
Hence M„{C) = 2", and we have 

ln(2") nln2 In 2 

D(C = /i'm n ^ 00 -— — = lim — — - = — -. 

In 3" n->-°° n In 3 In 3 

V.1.7 Use boxes of side length 1 /2". For each quartering of the figure, and its subdivisions, we divide each box into 4 parts, 
and one part contains none of the fractal subset points. Consequently, on each subdivision, the number of boxes increases 
threefold. There is one box of side length 1/2° hence 

In 3" In 3 
D(Figure)=lim — = — . 

V.1.8 One way to choose the boxes is to take side lengths 1/2", with the first subdivision running horizontally from just 
below the bottom of the caption, the vertical running vertically from the lefthand 'tip' of the bottom piece. Do it with a ruler 
to assure yourself that this divides the figure into three congruent pieces, and leaves the fourth of the four squares empty. 
Following the lead of the previous problem, the dimension is j^f . 

V.1.9 We wish to show that using a Cantor set constructed by removing middle thirds on the interval [0, 3] has the same 
fractal dimension as the usual Cantor set. We start with 2° intervals of length 1/3 0 " 1 . The rest of the procedure is the same, 
except that we now have 

ln(2") nln2 In 2 n _ In 2 

D(Big Cantor) = lim — — — - = lim = — lim r - r~v 

v 6 ' n-oo ln(3" _1 ) n- 00 in — 1) In 3 In 3 «^<» n - 1 In 3 

V.1.10 Let A C R 2 be a compact non-empty subset. A has fractal dimension D x in the Euclidean metric, and D 2 in 
the Manhattan metric. The metrics are equivalent, consequently there is a transformation B : R 2 -> which forms the 
equivalence. By Theorem 1.2 in Chapter V then, we have D X {A) — D 2 (0(A)) or D x - D 2 . 

V.l.ll Let B = ( ^ 2 ^ 2 ), and define two IFS 

•0) + (i)-C)-'C) + (? 



with attractor A\, and 



•(;) + G)-'C)-'G)(: 



with attractor A 2 . 

These are two Sierpinski triangles, one with right angle at the origin, and hypotenuse between (1,0) and (0, 1), the other 
with right angle at (1, 1) and hypotenuse between the origin and (0, 2). We rotate one into the other, expanding by a factor 
of two, to generate an equivalence between the two systems. This requires an invertible affine map. One which works is 



•(;)-(-/ =:)(;)+(: 

The two attractors are metrically equivalent, and therefore have the same fractal dimension. 
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Answers to Chapter V, section 2 

V.2.3 Let B be given by 

B -\0 1/2 J 

Then we can use the following IFS to generate ■: 

w\ = Bx 

Each map has a contractivity factor of 1 /2, so the fractal dimension by Theorem 2.3 in Chapter V is given by 

1/2° + 1/2° + 1/2° + 1/2° = 1 

This yields 1 /2 D = 1/4 or D = 2 as desired. 

V.2.4 We get the fractal dimension of the Cantor set from the IFS 

1 1 2 

{[0, 1]; wi(x) = -x; w 2 (x) = -x + -}. 

Each map has contractivity 1/3 so we have 

1/3° + 1/3° = 1 or 1/3° = 1/2. 

Then -D In 3 = - In 2 or D = £§. to the equation 3(1/2°) + (2/5) D = 1. Solved numerically, this gives D sa 1.874. 
V.2.5 The attractor contains 3 images of itself scaled by 1/2, with two of them rotated by tx/2 and shifted. Assuming that 
the attractor is centered at the origin, and extends two units in either x -direction and unit in either y direction, the IFS is 
given by 

where B is given by 

0 -1/2 
1/2 0 

The scaling factors are all 1/2, leading to the equation 3/2° = 1, or D = |^|. 

V.2.8 This attractor has 6 maps, 3 at 1/2 scale, 3 at 1/4 scale. It is overlapping, we need an upper bound which we get from 
Theorem 3.1 in Chapter V. We have, 

3 -(1/2°) + 3- (1/4°) = 1 

Rewriting this as 

3-(l/2) 2D + 3-(l/2) D - 1=0 
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yields 1/2° = ±(V21/6) - 1/2 or 

111(721/6- 1/2) 



D 



In 2 



V.2 9 Let w ■ • R 1 -> for i = 1 , 2, be an IFS with scaling factors si,s 2 respectively. 

That the new maps are similitudes follows from the general fact that the composition of similitudes is a similitude. To see 
this write the two similitudes as 

r : R l Re l o r 2 R 2 Re 2 (x) 

Since reflections are diagonal, they commute with the other matrices, the scalars commute as well. The composition of 
rotations is a rotation, the composition of reflections is a reflection, multiplying scalars together is a scalar. We are left with 



which is a similitude. 
The new contraction factors are 



r x r 2 R\R 2 Re x R$ 2 



2 —a 2 

s V{ = s x s V2 = S\S 2 S V3 — S 2 S\S VA — s 2 



5l| D + N D = 



Using the original system in Theorem 3.1 in Chapter V, we have 
The new system yields 

\s 2 l \ D + 2\s lS2 \ D + \s 2 2 \ D = (N D + \s 2 \ D ) 2 =\ 
The calculation of dimension remains unchanged, since the first equation is a solution of the second. 

Answers to Chapter V, section 3 

The exercises in Section 3 are all experimental in nature. 



Answers to Chapter V, section 4 

V.4 1 The fact that M (A p, e) is a non-increasing function of € is proved by looking at the sums over which the infimum is 
taken. Suppose €, < € 2 and that M(A, p, €,) < M(A, p, € 2 ). Since any sequence {A,} which qualifies for ^ is a sequence 
of sets with diameters less than € 2 , it follows that by taking the same sequence in the computation of M{A, p, e 2 ), the 
infimum must be less than or equal to the sum generated by this sequence. But this is a contradiction. 
V4 2 Let A = {a b c d e f g} be a subset of (K 2 ,Euclidean). Let 2r be the minimum distance between points in A. By 
choosing € < r, we must,' for any € cover the set with at least 7 sets A„ . . . A 7 . We can also cover A exactly with 7 sets of 
any epsilon of this size. Consequently, the infimum is generated for any p by the limit of the finite sum as € - 0 which ,s 0 
for any p > 0. When p = 0 the sum is exactly 7. 

V.4 3 Let A be a bounded countably infinite subset of R 2 , under the Euclidean metric. Let p : Z + -> A be a rule of 
assignment which takes each positive integer i to an element a, e A, where p is 1-1 and onto. Such a function exists by 
definition of countability. Then the sequence of subsets A, = {a,} of A is in A, since their union is the set A. But the diameter 
of any singleton set in R 2 is zero, so that for any p > 0 

oo 

^(diam({a,})) / '=0 

1=1 

Hence for any p > 0, M($A, p, O = 0 for any e > 0, consequently, M(A, p) = 0. On the other hand, if p = 0, then for any 
non-empty subset of A, (diam(A,)K = 1. For each n > 1, there is an € > 0 such that there are more than n disjoint subsets 
of A, hence M(A, 0) = oo. 
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V.4.4 For p = 0, given e < ±, there are at least 2" subsets in any sequence of subsets of diameter less than e which cover 
C. To see this, the endpoints of 2" disjoint intervals of diameter £ are elements of C, and can not lie in the same subset 
Hence as e -* 0, M(C, 0, e) -* oo. Since 0) is the supremum of these numbers, it is also infinite. For p = 1, given 

€ > 0, we may choose the smallest n such that ± < e, and cover C with exactly 2" subsets with diameter < e, by taking the 
intersection of C with subintervals of [0, 1]. The infimum for any e > 0 is the value of the sum as n ^ oo, which is 

,. 2" 
hm — = 0 

n-i-oo 3" 

Since M(C, p, e) = 0 for any e > 0 we have M(C, 1) = 0. 

V.4.5 The limit scheme forces a minimization of the number of sets by the diameter of sets. Suppose we attempt to find 
M(A,p) such that this number is less than infinity, but non-zero (this is not always possible). For the Sierpinski triangle, we 
may proceed by using a sequence of e values that are the diameters of the subtriangles. That this is sufficient is guaranteed 
by problem 1. For any p then, the values calculated for exactly the sequence of subtriangles leads to the infimum desired. 
We know that calculating on the subtriangles, we can minimally cover A with 3" triangles of diameter d ■ 2~ n where d 
is the diameter of A. Then we have, for each n, p a value of d ■ 3" ■ 2~"p as a candidate for the infimum. Notice that as 
n -* oo for p small, this is an increasing sequence, consequently our infimum at fixed n, p is the value at n. For p large, the 
sequence converges to 0, so that we have one infimum for all e values. These p values correspond to infinite or zero values 
for M(A, p, e). The value at which we will getajinite nonzero number is when, independent of n, we have 2 np = 3". This 
value is p = log 2 3 = (In 3)/(ln 2). At this valueTthe terms in the sequence are identically d, the diameter of the Sierpinski 
triangle. 

V.4.6 As the following proof will indicate (exercise V.4.7), to get strict inequality one must have a situation where it takes 
more sets which are box shaped to cover A than if another, better shape is chosen. 
V.4.7 We have by definition, 

D(A) = lim ^ 

for the fractal dimension, where the diameter of each box is e times some fixed constant. Fixing e > 0 and rearranging this 
formula, we have 



(0- 



£>(A)ln( - ) =lnAC 
1 =M. 



:D(A) 



004) 



Since for A, chosen such that A,- = fl, n A, that is the intersection of each box with the set A, we have diam(A ( ) < B if it 
follows that for this choice of e, M(A, D(A), e) < oo, since it must be less than or equal to this choice of A,. Since D(A) 
converges, this must then be the case for every choice of e, and we have that 

oo 

lim ^(diam(A,))'' < 1 

for p = D(A). If this sum is zero, then D H (A) < D(A), if it is not, then D H (A) = D(A). Notice that if M(A, D(A)) =0, 
we cannot automatically conclude that these two are unequal, if it is non-zero then they are equal. 

V.4.10 Let A,, A 2 e (K m , Euclidean), be two metrically equivalent sets, and let 6 : A, ^ A 2 be a metric equivalence Then 
for any B c A,, we have diam(£(fl)) < C,diam(fl) and diam(fl) < C 2 diam(8(B)) for some constants C,, C 2 . Conse- 
quently, if M(A U p) = oo, then for any choice of subsets, and e > 0, we must have E, C =i(diam(fl,))^ -+ oo Thus for 
any choice of subsets on A 2 this sum for the images of B i is equal to E,=,(C 2 diam(fl,))^ which is therefore also infinite 
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Hence M(A 2 , p) = oo. Conversely, if for some p, we have M{A U p) = 0, then the infimum over € values for all choices 
of subsets of Aj is zero. Consequently, for any 8 > 0, we may find a set of subsets satisfying diam(C,) < e for which 
M(A, p,€) <8 for every epsilon by choosing a sequence of subsets whose sum is less than S/d and using the metric 
equivalence to generate the sequence in A 2 . Consequently M(A 2 , p) = 0 as well. Obviously by using 8~\ and reversing 
the role of the constants, we can do the same, given a value on A 2 . Since the two sets agree on infinite and zero values for 
M{A, p), then the point at which the two values change are the same. Hence D H {A X ) = D H (A 2 ). 

V.4.11 Proving that a metric equivalence generates the same Hausdorff-Besicovitch dimension, is identical to exercise 
V.4.10. The second statement, that the Hausdorff D -dimensional measure M(A, D H {A)) is not preserved may be shown 
by example: Looking at example V.4.8, we may take a Sierpinski triangle with vertices at (0, 0), (2, 0), and (0, 2). Then for 
each step n in the example, we cover the triangle with 3" closed disks of diameter 2^2 ■ ±, and the example proceeds as in 
the text, except that 

M(A, D H (A)) = (2V2) ln3/ln2 

Consequently we may take the original Sierpinski triangle, with the metric d 2 (x, y) = 2 • d(x, y), or twice the Euclidean 
metric, and obtain a different value for the Hausdorff p-dimensional measure. 
V.4.12 They should be (inches) 1391 . 



Answers to Chapter VI, section 1 

VI.1.2 The Taylor expansion for sin(jc) near 0 is given by 

* (_l)« JC (2«+l) 



sin 



(*> = E 



x - — x 3 + — * s + 



(2/1 + 1)! 3! 5! 



giving a linear approximation of l(x) =x. Let e > 0. Then a linear coordinate transformation from [0, e] x [0,e] to 
[0, 1] x [0, 1] is given by 0((x, y)) = (7, With the change of coordinates we still get £'(*') = x', and the graph of this 
function, L is (x, x) e [0, 1] x [0, 1]. Transforming the function y = f(x) = sin(jc), gives us 



/ = /'(*') =0o/o0 _, (jc') = 



K sin(ejc) 



€ 



so the graph G is the set (x, ^) e [0, 1] x [0, 1]. 
The Manhattan metric distance between a point g = (x, ^f^ 1 ) on G and a point / = (y, y) on L is given by 

sin(ejc) 

d(g,l) = \x-y\ + \— yl 

The shape of an € ball in Manhattan metric is a square with vertical and horizontal corners. So to get the point on L closest 
to a given point g e G, we look for the intersection of the smallest isosceles right triangle with right angle at g intersecting 
L, which since the slope of L is 1, intersects from (x,x)to (^, sJs f 1 ) at the same Manhattan distance. Hence for any 
g e G the Manhattan distance to L is 

sin(ejc) 

d(g,L) = x — 

The derivative of ™ { * x) is decreasing, and this function is always less than x in this domain, hence the distance from G to L 
is given by the maximum, reached when x — 1 , or 

sin(e) 

d(G,L) = \ 

Going the other way, we base an isosceles right triangle with the right angle at (y, y) and look for the smallest such triangle 
intersecting G. Once again, since the slope of G is decreasing, this will touch when we take the first coordinates equal, 
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hence 

sin(ey) 

d{l,G) = y — 

e 

and this is maximized at (1, 1). Hence 

sin(e) 

d(L, G) = 1 — =d(L, G) = h(L, G) 

€ 

Noting that this is indeed decreasing as e — > 0 since 

,. sin(e) 
hm = 1 

we calculate 

h(L,G) = .01 .99e = sin(e) 

or e less than approximately . 1 . 

Answers to Chapter VI, section 2 

VI.2.2 Let 

f(x) = F,-_! + X ~_ X - l (F, - F,_,) 
The first step is to show that if (x, y) e G then so is Wj(x, y). Let (x, y) e G, that is (x, y) = (x, f(x)). Then (x, y) goes to 

(r.)-G) 



/ x i~ x i-\ \ 

\ X N~ X 0 ) 
V X N~ X 0 ) 



x i x N x i-\-XQXj 
x N -x 0 

x N F i~\- x 0 F i 



+ 



x N -x 0 

To compute f(x) for the transformed x we do 



Xj — Xi — i Xf^Xj — i — XqXi 
x _| x . 

X N ~ x 0 X N — x 0 

Xi — Xi-\ XpjXj-i — XqXi Xi_\Xfi — Xi_\Xq 

= x H 

X N — x 0 X N — x 0 X N x 0 

Xj — Xj-i X\-\Xq — XfjXj 



X N — x 0 X N x 0 

(x - x 0 ) ( — — j 

\ X N x 0 / 



Dividing by x t — x,-\ yields 



X — Xfl 



X N — x 0 

and multiplying by (F, - F,-_i) and adding F,-_i yields 

(F,- — F,-_i)(x — ^o) Fi_\X N — F-i^o 



\ X N — Xq / 



, Fi-\ X N — F-Xo 

a: H = x + f 

X N — x 0 
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hence 

N 

\Jwi(G)cG 

i=i 

To show equality it suffices to show that the endpoints of the graph map to the endpoints of the z' ,h interval under u>, hence 
showing that the map is onto. We substitute x 0 , x N for x above to get 

. . • ■ , , x N x i-\ ~ x 0 x i 

diXx + e, = )x N + 



( x i ~ x i-\ 
X N — Xo 

(Xj — \ 
X N -^0 / 



X N — -*0 



ciiXo + ei — [ — — 1 ) x 0 H = Xj-i 

X N ~ x 0 



so that the union of the u>, is onto. Hence 

N 

{J Wi (G) = G 

i=i 

as desired. 

VI.2.5 For this exercise we are asked to find an IFS whose attractor is a parabola passing through the points (0, 0), (1, 1), 
and (2,4). There is only one such parabola of the form y = Ax 2 + Bx + C, which we will find, and its equation is 
f(x) = x 2 . The IFS is of the form 



U>2 



:)(;)+(;:) 
(;)=(:::)(;)+(;:) 



which we can insure passes through the interpolation points by using the equations for the various terms given in the text. 
To insure that this IFS has the parabola y — f(x) as its attractor, we substitute a point from this graph into the equation to 
yield 



so that in both cases, 



We calculate 



yielding 



( x \ _ ( a x x + e x 
Wl \f( x )J ~\c l x+dj(x) + /, 

(x \ _ ( a 2 x + e 2 
fMJ ~\c 2 x +d 2 f(x) + f 2 

(<2,x + eif = c t x + djX 2 + fi 



n — l ~° — 1 p — °-° — n 

"2 2-0 2 2 2-0 

r — 1-0 _ j 4-0 _ 1 M r _ 0-0 j 0-0 _ n 

C l — 2=0 "l2=0 _ 2 ~ Z£il ■* I 2=0 ' 2=0 

c 2 2-0 " 2 2-0 2 ^"2 J1 2-0 " 2 2-0 1 



1 , 1 , 

(-x) 2 = (--2d x )x+d x x 2 
(ijc + l) 2 = (1 - 2d 2 )x + d 2 x 2 + 1 



492 Selected Answers 



These are solved by equation of coefficients: 



4 




0 



4 



and 



-x 2 + * + ! = -*- 2d 2 x + d 2 x 2 + 1 



4 2 




This leaves c x = 0, and c 2 = 1 . We have 




VI.2.10 By exercise II.2.1 1, the Euclidean and Manhattan metrics on R 2 are equivalent, so it suffices to show that the metric 
in Theorem 2.1 in Chapter VI is equivalent to the Manhattan metric on R 2 . We do this by looking at the three possible cases: 
If 6 = 1 the two metrics, d from Theorem 2.1 in Chapter VI and d M the Manhattan metric, are identical and therefore 
equivalent. 

If 9 > 1, then since 0\y x - y 2 \ > lyi - ^1 we have d((x x , y x ), (x 2 , y 2 )) > d M ((x u y{), (x 2 , y 2 )), and since j-\x x - x 2 \ < 
\x\ — x 2 \ we have 



so they are equivalent in this case as well. 

VI.2.13 If we look at any IFS of the form {R 2 ; wi(x, y)}, then if this IFS is hyperbolic, by definition w x is a contraction 
mapping, and the attractor of the IFS is a single point. But for N = 1 , the curve we are attempting to interpolate has 
two interpolation points. Hence the attempt will always fail. Another way to look at it is that by exercise VI.2.5, these 
interpolations can always be arranged to be quadratic, a quadratic interpolation always requires at least 3 values. 
VI.2.14 By definition, we have 



where t n (x) = a n x + e n , on each subinterval [x„_i, x n ]. We need to show that for / e T, Tf is an interpolation function. It 
is shown in the text that TfeF and hence does the proper job at the endpoints. What is necessary is to show that Tf takes 
each function / to a function which agrees with / at the internal interpolation points. By definition, l~ x takes x„_i to x 0 and 
x n to x N . Because f eT, f takes x 0 to F 0 , and x N to F N , and we have 




(Tf)(x) =c n r n \x) +d n f(r n \x)) + f n 



{Tf){x n -i) = c„x Q + d n F 0 + f n 
(Tf)(x n ) = c n x N +d n F N + f n 
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into which we substitute the definitions of c n ,d n , and /„, to get 

(r/)(x„_!) = (F n x 0 - F n -\x 0 - d n F N x 0 + d n F 0 x 0 

X N ~ x 0 

= quad + d n F 0 x N - d n F 0 x 0 + x N F n -\ - x 0 F„ - d n x N F 0 + d n x 0 F N ) 

= Fn-X 

(Tf)(x n ) = (F n x N - F„^x N - d n F N x N + d n F 0 x N 

X N ~ x 0 

+ d n F N x N - d„F N x 0 + x nF„-\ — x oF„ - d„x N F 0 + d n x 0 F N ) 
= F n 

as desired. Since Tf interpolates the data for any / e J 7 if / is a fixed point of the transformation, then by definition, the 

graph of / is the attractor of the prescribed IFS, and / is an interpolation function to begin with. 

VI.2.15 We wish to construct an interpolation using quadratic scaling, in which the w n (x, y) take the form 

w„(x, y) = ((a n x + e n ), (c n x + d n y + g n y 2 + /„)) 

As before, we need to take the data pairs (x 0 , F 0 ) to (x n -i, F„_i) and (x N , F N ) to (x n , F n ). Since the transformation in x, is 
the same, we may define a n , e n as before, namely 

X n — X n _\ x n-l x N ~ x n x 0 

a n = and e„ = 

X N — x 0 X N ~ x 0 

This will insure that the first coordinates of the data pairs are mapped properly. For the second coordinates, we must satisfy 
the equations 

F„_! = c n x 0 +d n F 0 + g n Fo + f n 
F„ = c„x N +d n F N + g n F^ + f„ 

Notice that c„ and /„ must coincide with the linear case when g n is zero. This gives 

F„ — F n _\ F N — F 0 
c„ = d n g n X c 

Xn — Xo X N ~ X Q 

where we need to fill in for X g , and 

x N F n -i—x 0 F n x n F 0 — x qFn v 

fn = dn gnXf 

Xn — Xq Xn — Xo 

There are no quadratic terms in the equations to be satisfied, consequently, these must cancel at x 0 , x N , giving us 

gnFo=gnX c X 0 + g„X f 
gnF N = gnX c x N + gnXf 



Dividing out g„, and subtracting these yields 



F N — Fq = (xn — xo)X c 

Fl - F n 2 



X r = 



Xn — x o 

Putting this back into the first equation after dividing out g„, yields 
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V 1 — F 2 
Af — r 0 Xq 

Xn ~ x 0 

_ xnFq — xqFq — F^xo + FqXq 

Xn — Xq 

_ *nFq — xpFjj 

X N — Xq 



so that we have 

_ ^" ~ ^"~ X — rt ^ N ~ ^° — ^ ~ 
C n — (*n Sn 

Xft — Xq Xn — Xo Xn ~ Xq 

_ XnF„-\ — XqF„ X^/Fq — XqF n XnFq — XqF^ 

Jn = "/i Sn 

X N ~~ x 0 X N ~ x 0 X N ~ x 0 

to complete the coefficients for the quadratic case. 
VI.2.16 Consider the integral 



= / f(x)dx 

Jxn 



I 

IXQ 

which here is well defined, since / (x) is continuous and of bounded variation. We assume that / is the fixed point of the 
fractal interpolation transformation, i.e. Tf = /, where {F, d) is the metric space of functions which match the endpoint 
data, and are continuous. Then we have 

N 



I = / (Tf)(x)dx = J2 / (Tf)(x)dx 

J*Q o=l * x n-\ 



where 



A C XN P 
= 2^ (c n x + d n f(x) + f n )d(a n x + e n ) = al + p = —— 

o=l J*0 01 
N ~ N sx N 

ce = 22 a " dn and P = /2, a "\ ( c " x + f") dx 

o=l 0=1 J x 0 

We need to show that under the standard assumptions, |a| < 1, we have 

N N 

l«l = \ ^a n d n \ < ^ \a n d n \ 

0 = 1 0=1 

and 

N 

y^q„ = 1 and \a n \ < 1 

0=1 

The x„ are ordered, consequently a n x 0 + e n < a n x N + e n which implies a n > 0, hence \a n d„ \ = a n \d„\, and therefore 

N 

l«l < ^ a n \d„ 1 

0 = 1 

which implies |a| < 1. To show that 

f x N 

/ f 0 (x)dx=P 

JXQ 

we merely note that p is independent of d n , and the result follows from above. 
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VI.2.17 We want the integral 

r x N 

I] — I xf(x)dx 

under the same assumptions as in exercise VI. 2. 16. Once again since Tf = /, we have 

/, = / x(Tf)(x)dx = 2_J / x(Tf)(x)dx 

Jx 0 „ = 1 Jx n -\ 

so we transform variables. 

(Tf)(a n x + e„) = c n x + d n f{x) + f n 
and we replace x with to write the same integral as 



22 I —^-(CnX +d n f(x) + f n )d(a n x +e n ) 

n = l ' X Q 



a n 

N 



~M a~ (Cn 

„=i ^ x n 

The first integral can be written as 



— (c n x +d n f(x) + f n )d{a n x + e n ) 
a n 



x + d n f(x) + f n )d(a n x + e„) 



f x N x C X N d n 

/ — (c n x + f n )d(a n x + e„) + / — xf{x)d{a n x + e„) 

c x n r x N 
= / (c n x 2 + f n x)dx+d n I xf(x)dx 

Jx 0 0 

r x N 

= / (c n x 2 + f n x)dx + d n l x 

Jx 0 



The other half looks like 



r x N r x N 
e n I (c n x + f n )dx + a n e n I f(x)dx 

Jx 0 o 

which using the notation 7 0 for the integral I in exercise VI.2.16 yields 

r x N 

^ e n (c n x + f„)dx + d n e n I 0 

1 J x o 

We now let 
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_W -x N 

y = 22 ( c " x2 + f"X)dx 

& = 2_, e n / (C„X + f n )dx 

N 

n = \ 
N 

n = l 

Then we have 

h = y +6/1 - (0/o + 5) 
= Y ~ (0/o + S 

1 -6 

= y — 8 — (0)6/(1 — a) 

1 - 6 

where we have used the notation of VI. 2. 16 for the integral Io. We therefore must restrict the d n such that 6^1. Once again, 
y, 8 are independent of d n , and we arrive at the integral = y — 8 in the case that d n — 0 for all n. 

Answers to Chapter VI, section 3 

The exercises in Section 3 are all experimental in nature. 

Answers to Chapter VI, section 4 

VI.4.1 Define the metric on I x Y by 

d((x u y x ), (x 2 , yi)) = |*i - x 2 \ +0d Y {y x , y 2 ) 

where a = max{a, : / = 1, . . . , N}. 

d Y (M n (a, y), M n (b, y)) < c\a - b\ 
d Y (M n (x, a), M n (b, y)) < sd Y (a, b) 

and 9 = ~. c, s > 0, and under the usual assumptions about the ordering of a:, , 1 > a > 0. We wish to show that under 
these assumptions, the IFS is hyperbolic with respect to the metric d. Under each map, (x, y) h» (L„(x), M n (x, y)) so that 

d(w n (x u y x ), w n (x 2 , y 2 )) -d{{L n {x x ), M n {x x , y x )), (L n (x 2 ), M n (x 2 , y 2 ))) 

= \L n (x\) - L n (x 2 )\ +0dy(M n (x u yi), M n (x 2 , y 2 )) 
= \a n x x - a n x 2 \ + 0d Y (M n (x x , y x ), M n (x 2 , y 2 )) 
= |a„||jci - x 2 \ +0d Y (M n (x x , y^, M n (x 2 , y 2 )) 
<a\x x - x 2 \ +0d Y (M n (xx, yi), M n (x 2 , y 2 )) 

The second half of this expression can be decomposed using the triangle inequality to read 

d Y (M„(xx, yx), M n (x 2 , y 2 )) 

< d Y (M n (xx, yi), M n (x 2 , yx)) +d Y (M n (x 2 , y x ), M n (x 2 , y 2 )) 

< c\xx - x 2 \ +sd Y (yx, yi) 



Selected Answers 497 



Putting this together with the statements above, we have 

d(w n (x u yi), w n (x 2 ,y 2 )) <a|*i -*2l + 0c\x { - x 2 \ + s9d Y (y x ,y 2 ) 
2a + 1 — a 



2 

1 +a, 



-x 2 \ + s6d Y (y u y 2 ) 



xi - x 2 \ + sOd Y (y\, y 2 ) 



Since 0 < a < 1, we have 1 + a < 2 so let s' = max(^, s) < 1 then 

d(w n (xi, yd, w n (x 2 , y 2 )) < s'\x x - x 2 \ + s'9d Y (yi, y 2 ) = s'd((x lt yi), (x 2 , y 2 )) 
so the IFS is hyperbolic as desired. 

VI.4.2 By Theorem 4.1 in Chapter VI, the attractor exists. We devise the operator T : T -+ T as in Theorem 2.2 in Chapter 
VI to be 

(Tf)(x) = M n {f{L~\x))) x e [*„_!,*,,] 

where T is the set of continuous functions / : [x 0 , x N ] Y. L n is a n x + e n which is continuous with continuous inverse, / 
is assumed continuous, and M„ is again linear, so that Tf is a continuous function on each of the [x„-\ , x„]. We must assure 
that T takes T into itself. We have 

X\ — Xo X/yXo — X(,Xi 

Li(xo) = a\x 0 + e\ — x 0 H = *o 

X N ~ x 0 X N — -^0 

Xn — Xw-l . x N x N-l — X N X 0 

L N (x N ) — a N x N +e N = x N H = x N 

Xn — Xq Xn — Xo 

X n — X n -\ XnX„^\ — XoX„_i 

L n ( x n-l) = «n^n-l + = x n-\ H = x n-\ 

X N — x 0 X N ~ x 0 

X n — X n -\ X/yX n — \ — X n XQ 

L n {x n ) — a n x n + e n — x n H = x n 

X N ~~ x 0 X N ~ X Q> 

so the endpoints of each segment conincide with the next and last, and x 0 , x N map to themselves. Hence these points are 
fixed under L n and therefore under L~ l . By definition, 

(Tf)(x 0 ) = M n (x 0 , F 0 ) = (jc„_,, 
(Tf)(x N ) = M n {x N , F N ) = (x n , F n ) 

hence T : T ->• T as desired. We now show that T is a contraction mapping. Let f,g eT. Then we have 

{Tf)(x) = M n {L-\f{L-\x))) 

{Tg){x) = M n {L-\g(L-\x))) 
With the given definition of a metric, and the restrictions imposed on M„ we have 

d(Tf, Tg) = max d Y (M n (L;\x), f(L~\x))), M n (L~\x), g(L n \x)))) 

xe[x n _\,x n ] 

< max sd Y (f(L-\x)),g(L-\x))) 

xe[x n _i,x n ] 

< d(f, g) 

as desired. 

VI.4.8 We want to show we can generate polynomial graphs with affine hidden variable interpolation. Using the proof given 
in exercise VI.4.2, we have (Tf)(x) = M n {L~ l {x), f{L~\x))). Consequently, the attractor of the IFS is given by 

f(x) = M n (L- l (x),f(L;\x))) 
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where we let f(x) = (y(x), z(x)). We are interested only in fitting the first component, the second component may be 
chosen for ease of manipulation of the equations. We use the notation of the book, namely 

L n {x) = a n x + e n 

' x ' 



From this we get 



\l n m n J 

(y {x) \ = A (c n (L- l (x)) + f n \ 

\z(x) J " \z(L; l (x))J \k n (L- l (x)) + gJ 



from which we extract the equation for y(x) to get 

y(x) = d n y{L n \x)) + h n z{L-\x)) + f n + c n L~\x) 

Newton proved that it requires N + 1 interpolation points to generate an N th degree polynomial, so that we assume that we 
have been given these points, and that they fit y(x), and likewise assume 

y(x) = a N x N + a N -iX N ~ l H h a 0 

Since L n is linear, so is L~ l , so the degree of y(L~ l (x)) is also N, and we let (for notational convenience) 

y{L-\x)) = p Nn x N + p N - ln x N ~ l + ■ ■ ■ Po n 



Then we have 



which gives 



a N x N + • • • + ao = d n y(L- l (x))+h n x(L;\x)) + f n + c n L; l (x) 

= d n p Nn x N + ■■■ d n fo n + h n z{L-\x)) + f n + c n L~\x) 



(a N - d n p Nn )x N + ■ ■ ■ (a 0 - d n p 0n ) = h n z{L-\x)) + f n + c n L~ x {x). 

Now d„, as in the non-hidden variable case, is independent of the fitting of the points (x { , F,), hence on each subinterval, we 
may choose 

d n = 

Then the left hand side is a polynomial of degree N — 1 so we may choose h n , and let 

z(x) = y N -\x N ~ l H h y 0 

in such a way so that if 

^_,„x A '- 1 + --- + 5o n =z(L; 1 W) 

then we have 

h n 8 in - a, - d„p in 

for i > 2 and 

h n 8 ln x + h n 8 0n + c n L~\x)) + f„ = (a, - d n fi ln )x + (a 0 - d n fo n ) 

by equation of coefficients. Thus we may generate a hidden variable fractal interpolation function for a polynomial of 
degree N. 
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Answers to Chapter VI, section 5 

VI.5.1 From Theorem 5.1 in Chapter VI, using three maps, we may choose 



*\ /\ 0 0 \ /x\ / 0 



wi ( y = 0 a, b x )\y + \e x 

,zl \0 c, d,/ W V/. 

' X \ ° / 2 

«>2 1 y ) = I 0 «2 ^2 ) I y I + I <? 2 

z! Vo c 2 j 2 7 W \/ 2 

x\ /\ 0 0\/x\ / § 



^ ( y = p a 3 fc 3 y + [ei 

.z! V0 C 3 d 3 / \zJ \h 

For the Sierpinski triangle, the appropriate values are given by a, =d t = \, bi = c, ■= 0, and ei = f\ — 0, e 2 = 5, f 2 = 0 and 
e 3 = 0,/ 3 = 5. 

Answers to Chapter VII, section 1 

VII.1.1 Let {1R 2 ,/} be given by 

'(2x,2y-l) ify>l/2, 
f(x,y) =■ (2jc - l,2y) if jc > 1/2, 
(2x, 2y) otherwise. 

If (x, y) > 0 and (x, y) $ S, then eventually y > 1/2 since y is doubled if x > 1/2 or if neither x, y > 1/2. Once this 
happens we have 

f"(x,y) = (2 n x,2"y -2 n ~ l ) 

Given y > 1/2, we have 

lim 2 n y - 2"- 1 = lim (2y - 1)2"-' = oo 

n—KX> n—KX 

and x oo , as well. Hence these points escape to infinity. If y < 0, x > 0, then eventually 2"x > 1/2, if x < 1/2. Once this 
happens we have 

lim 2"x - 2"" 1 = lim (2x - 1)2""' = oo 

n— *oo 



for any point whose x coordinate is larger than 1/2. The y coordinate remains negative, and 2"y -> -oo. Hence these points 
also escape, in this case in the direction of (oo, -oo). For points y < 0, x < 0, each coordinate remains negative indefinitely 
and these points diverge as 2"x -oo and 2 n y -oo, so these points escape to (-oo, -oo). 

VII.1.3 The condition x 2 + y 2 > R determines the color based on the iteration number. This is passage past a circle of 
radius ^R. Since if f(x, y) = (2x, 2y) we have /"(*, y) = (2"x, 2 n y), and 

(2"x) 2 + (2">0 2 = 2 2 "(x 2 + y 2 ) > R x 2 + y 2 > ^ 

we have concentric circles of radius ^ . 
VII.1.6 These two functions, 

wdz) = re i0 z-l w 2 (z)=re w z+ 1 
take a disk of radius R, to a disk of radius rR, rotate it by 0, and move its center to either ±1. Consequently, if we can find 
a disk which is mapped i»to itself by both maps, we know that the attractor of the IFS is interior to the disk (see exercise 
VII. 1.8). We thus look for R as a function of r such that this is the case, and restrict it such that its two images do not 
intersect. This will force the attractor of the IFS to be totally disconnected. 
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We make one further simplification: If the rotation were 0, we could examine the transformation on the real line, since 
these transformations take circles to circles, this will correctly predict the values of R. This reduces the problem to looking 
at w, (x) = —rx ± 1. We look for a cross section of the ball [a, b] such that the images 

Wl ([a, b]) = [-rb -l,-ra-\] w 2 ([a, b]) = [-rb + 1, -ra + 1] 

both lie within [a,b]. These two intervals are of identical length and orientation, differing only by a shift of two along the 
real line. Consequently, our conditions are 

a < -rb - 1 b > -ra + 1 

Solving at equality yields 

a — —rb — 1 
b = -ra + 1 
a = r 2 a — r — 1 
(1 - r 2 )a = -r - 1 

1 +r 1 



l-r 2 l-r 

Symmetry now dictates that b = , so that a disk of radius R = l/( 1 — r) maps inside itself under each map of the original 
IFS. We now want the condition on r such that these images of the disk B R {0) do not intersect. This happens when a disk 
of radius rR, centered about 1 does not intersect the y-axis, namely that 

1 

< 1 



1 - r 

r < 1 — r 
2r < 1 
r < 1/2 

Consequently, for the original maps, as long as r < 1/2, the maps 

w x {A) n w 2 (A) c w x {B R (0) n w 2 (B R (0) = 0 

and the IFS is totally disconnected. We must now define f(z) such that on the attractor, we do this with three maps (letting 

z = z\ + izi), 



f(z) = 



w x if Zi < — t^7 



l-r 



W 2 if Z\ > 

re i6 z + k k = otherwise 



VTI.1.7 This is essentially the general case of the preceding problem. Let {A; /} be the associated shift dynamical system 
of a totally disconnected hyperbolic IFS on R 2 , where A is the attractor of the IFS. Then we must have 

f(a) — w~ l (a) for a €Wi(A). 

Since the IFS is totally disconnected, we have 

Wi(A)nwj(A) = 0 ifi#y 

Furthermore, since the IFS is hyperbolic, Wj(A) is compact for each i. Consequently, in [R 2 this means d(w, (A), Wj(A)) > 0 
if i ^ j. Then we may extend {A; /} using any function g(x) which smoothly connects the maps in the intervening 
distance. For example, since there are finitely many maps, we may define 

e = mind(Wi(A), w:(A)) > 0. 
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Now construct functions which are equal to iu f 1 on w, (A), and decrease to zero at a distance less than e/2 from these sets, 
by taking functions 

. i(x) = ((l--^)«, r »(x) if^,(A))<l 
1 0 otherwise 

with k > 2. Finally, sum these together to get 

N 

gk(x) = Y^Wi(x) 
(=1 

Because the distance e was chosen to be less than the distance between images of the attractor under w if these functions are 
continuous, agree with / on the attractor, and since k can be taken at any value greater than 2, there are infinitely many of 
them. 

VII.1.9 We get the iteration formula by generating the real and imaginary parts of /(z) = z 3 - k. Letting z = x + iy, we 
have 

z 3 = ( X + iy) 3 = x 3 + 3x 2 iy + 3xi 2 y 2 + i 3 y 3 
= x 3 + i3x 2 y - 3xy 2 - iy 3 
= (x 3 — 3xy 2 ) + i(3x 2 y — y 3 ) 

so that the new formulas, letting k = k { + ik 2 are 
newx = x~3 - 3*x*y~2 - lambdal 
newy = 3*x~2*y - y"3 - lambda2 

The choice of R can be derived as follows: Writing z = re w , we have z 3 = r 3 e i39 , which has a distance from the origin of r 3 
compared to the distance r of z. Subtracting k will have maximal effect on this distance when z 3 and k lie on the same ray 
from the origin, namely when k = pe'* and $ = 36. Consequently the problem reduces to this maximal case, namely that 

r 3 - \k\ >r 

or 

r 3 - r - \k\ >0 

The minimum value for | A. | is zero, and the inequality acheives equality for this k value when r = 1 + | A. | . Choosing this as 
a candidate value for R, we find that 

(1 + |A|) 3 - (1 + \k\) - \k\ = |A.| 3 + 3|X| 2 + 3\k\ + 1 - 1 - 2\k\ 
= \k\ 3 + 3\k\ 2 + | A. | >0 

so that for any k e C, choosing R > \k\ + 1 will work in the escape time algorithm. 

Answers to Chapter VII, section 2 

VII.2.1 In this case X = C, we must show the presence of sets such that W(A) = A, where W(A) = U w 2 (A). 

Choose A = {O}, then w x (A) = w 2 (A) = O. Likewise, choose A = {00} , then wdA) = w 2 (A) = {00}. Consequently, the 
unrestricted IFS in this case does not contain a unique invariant set. In point of fact, neither of these sets above are attractors 
of the IFS. The system is not contractive around these points, which are repellors. The point z = 1 is also a fixed point of 
the system, it is a repellor, and is part of /(/). 

VII.2.2 Let k e (3/4, 5 1 /4) and f(z) = z 2 -k. We want to look at solutions to 

z 2 + z + (l -A.) = 0. 
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Using the quadratic equation we obtain 

-1 ±V1 -4 + 4A. -1 ± V4A-3 



z 

We let 



■1 + V4A-3 



-1 - V4A-3 

Z2= 

The easiest way to examine cycles of period 2 here is to check to see whether f(z\) = Zi and f(z 2 ) = Z\ (different ap- 
proaches are used for different problems, this avoids quardic equations here). We have 



2 1 V4A -3 4A-3 t 1 V4T^3 
<> = 4-— 2- + -T- = ^-2 } -— 2— 



2 1 V4A-3 4A-3 / 1 V4A-3 

^ = 4 + + — = (A--) + - y - 

Subtracting k from each of these yields the desired values f{z\) = Zi, and f(z 2 ) = Zi, hence this is a cycle of period 2. 
To check whether the cycle is attractive, we have 

f{z) = 4z 3 - 4k z 
We can simplify this for each z, by making use of the period 2 properties: 

z] = (Z2 + *)Zl = Z\Z2 + Xzi 
zl = (Z\ + X)z 2 = Z1Z2 + ^Z2 

giving us 

f 2 '(zO = f 2 '(z 2 ) = 4z l Z2 



and 



1 - (4k - 3) 4 - 4k 

4 Zl z 2 - 4 — = — - — = 1 - k 

4 4 

For k 6 (3/4, 5/4) we have (1 — A.) 6 (—1/4, 1 /4) so that 

l/ 2 'fe)l< \ < 1 
4 

Hence this is an attractive cycle of period 2. 

VII.2.3 Let k be specified and suppose a ■ i e J x . Then -a 2 e J x and a 4 e 7 A are real line points. The set J x is connected, 
therefore there is an extreme point of this form which maps to the left most extremity of J x , on both sides of the real axis. 
The pair of IFS cut J k at the negative extremity and map this image to the left and right side, depending on the definition of 
the square root in the two IFS equations. 

VII.2.5 For the dynamical system {C; f(z) = z 2 - k : k > 2}, we wish to show that the orbit of the critical point {O} 
converges t o the Poi nt at Infinity. We know that the basin of attraction for oo contains at least the points for which 
r > 1/2 + Vl/4 + k so that if k > 2, we have r > 1/2 + 3/2 = 2. f 2 {0) = f(-k) which is 2 when k = 2 and increases for 
increasing k. Hence lim^^ f(0) = oo. By Theorem 2.1 in Chapter VII, the IFS {X = C \ B(oo, e); +VFTX, -y/z~Tk} 
possesses a unique attractor. 
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Answers to Chapter VII, section 3 

VII.3.1 The Newton's transformation associated with F(z) is given by 

F(z) _ z 4 - 1 

To see that 1 is indeed fixed, we compute 

/(1) = 1-0=1 

Taking the derivative with respect to z evaluated at 1, we have 

(4z 3 ) 2 - 12z 2 (z 4 -l) _ _ 16z 6 -12z 6 +12z 2 
/'(z) = 1 (^5)2 16z 6 

12z 6 - 12z 2 _ 3 _ _3_ 
16z 6 _ 4 4z 4 

evaluated at 1 we have /'(z) = 3/4 -3/4 = 0 so that 1 is an attractive (sometimes called super-attractive because /'(z) = 0) 
fixed point of the Newton's transformation. 

VII.3.2 The Newton's transformation associated with F(z) = z 2 + 1 is 

z 2 + i K , 1 
/(Z) = Z ___= Z --( Z + I ) 

= U-^ = \/2(z-- 
2 2z z 

so that the corresponding IFS is got by taking inverses of /(z) to yield 

1 

w = \/2(z- -) 

z 

2u>z = z 2 — 1 
z 2 - 2u;z - 1 = 0 

2w ± s/4w 2 + 4 , r-TT7 
4 2 

Consequently, the appropriate IFS is 

Wl (z) = z + Jz 2 +1 and u; 2 (z) = z - Vz 2 + 1 

For any z e R. z 2 + 1 > 0, hence (z) e R. For z = we have ..(z) - oo «id «*(z) - J ««^ly * U M>> 
R U {ci}, and the map is onto, by virtue of the behavior of w 2 (z) for large z. Then if A = R U {oo}, W(A) - A. We must 
make sure we can exclude the fixed points of the transformation. These are given by 

1 1 

2 (z -7 ) = z 

or Zf = ±L The derivative of /(z) at these points is 1/2(1 + l/(z 2 )) = 0 so these are attractive. Hence removal of a small 
,-ball around ±/ results in a space XcCon which A is a unique invariant set. Therefore A is the attractor of the IFS. 
VII.3.5 The Newton's transformation associated with F(z) = e z - 1 is given by 

f(z) = z-^- = (z-D + e- z 

It has two fixed points in C, for we have 

z = (z-D + e- z ^e- z = l^z = 0 
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indicating that 0 is a fixed point. We also have 

lim (z - 1) + e~ z = oo 

Z-+0O 

so that oo is also fixed. The derivative of f(z) is given by 

f'(z) = l-e-* 

which is zero at z = 0, and approaches 1 at infinity. Hence the attractive fixed point is 0, oo is a neutral fixed point. 
VII.3.7 We wish to find a Mobius transformation h(z) which establishes a conjugacy between f(z) and g(z). Thxs means 
finding h(z) such that 

f o h(z) = h o g(z) 

where 

f(z) = \(z-\) g(z) = z 2 Hz) = °^ d 
We mention at the outset that such a transformation is indeterminate with respect to scaling factors, so there will be choices 
among a, b, c, and d. Writing out the two terms of the conjugacy we have 

, x az 2 + b 
h o g(z) = 2 , , 
cz + " 

and 

l/az+b cz+d\ 1 ( (az+b) 2 -(cz+d) 2 \ 
fohiz) =2W+d az+b)~2\ (cz+d)(az+b) J 
_ 1 / ah 2 + 2abz + b 2 - c 2 z 2 - led - d 2 \ 
~2\ acz 2 + bcz + adz + bd ) 
_ I ( (a 2 - c 2 )z 2 + 2(ab - cd)z + (b 2 - d 2 ) \ 
= 2 \ acz 2 + (ad + bc)z + bd / 

so that 

W + 2fr (a 2 - c 2 )z 2 + 2(ab - cd)z + (b 2 - d 2 ) 
cz + d ~ acz 2 + (ad + bc)z + bd 

We may eliminate the linear terms in z by setting 

ad + be = 0 ab - cd = 0 



which give us 

a = -^^-—-cd = 0^d 2 = -b 2 
a d d 

d = - b S^ab + — =0^a 2 = -c 2 
a a 

With these equalities assumed, we have 

2az 2 + 2b _ (a 2 -c 2 )z 2 + (b 2 + d 2 ) 
cz 2 + d ~ acz 2 + bd 
Since, as mentioned, we have choices of scale, we set a = 1. This gives us 

2z 2 + 2 (l-c 2 )z 2 + (b 2 + d 2 ) 
cz 2 + d~ cz 2 + bd 
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and comparing denominators leads to b = 1 . Then we have 

2=1 -c 2 and 2=1 — d 2 
c = ±i and d = ±i 

and we again have a choice (this time the sign of the scale factor). We choose c = i. Then ab - cd = 0 or 1 - i d = 0, and 
d = —i. Consequently we have 

z + 1 

h(z) 

Checking, we have 



iz - 1 



1 / z + 1 iz - i \ 1 
f(h(z)) = -(-. :-— =x 

2 \«z — i z + 1 / 2 



1 z 2 + 2z + 1 - (-z 2 + 2z- 1) 

iz 2 - iz + iz - i 
1 2z 2 + 2 z 2 + 1 



2 iz 2 - i iz 2 - i 



Ms(z)) 



as desired. 



Answers to Chapter VII, section 4 

VII.4.1 We analyze this map by first noting that this map is equivalent to the map 

z 2 - 1 for zi > 0 



f(z) l z 2 -l+kzi forzi<0 

where zi is the real part of z = Zi + iz 2 - Consequently, for A. = 0, and for any open set lying entirely in the right half plane, 
the mapping is open. For A. e [- 1 , 0], a look at a small e-ball about the origin shows why the map is still continuous, namely, 
the action of the term kx is to shift the points within the ball over towards the right. This produces a closed oval, with points 
to the interior mapping to the interior and the boundary mapping to the boundary. Consequently an interior point is still 
surrounded by an open ball of the image of interior points, and the map remains open. Specifically, given (x, y), if we have 
a point y) to the left of (*, y), then since near the origin, x 2 x, the kx term dominates, and the disk is reversed, 
f(x u y) is to the right of f(x, y), and points with the same x value map to the same x value, creating an oval around the 
center (x, y). Then an open set maps to an open set. 

The mapping is not open for k = 1 however. To see this, choose first e 0 < 1/2, so that for points in an e 0 -ball about the 
origin, 2x 2 < x. Then for points on the boundary of the left half ball x 2 + y 2 = e 2 and the intersection with the y-axis, 
the portion on the axis maps to an interval on the real axis [-e 2 - 1, -1]. The rest of the boundary meets this interval at 
-to - 1 , but lies entirely to the left of it: 

* 2 + y 2 = 4 

f( x , y ) = ( x i-y 2 -\ + x,2xy) 

= x 2 - (€o~x 2 ) - 1 +x,2xy) 
= (-e 2 - \) + 2x 2 + x,2xy) 

Since we have chosen e 0 such that the x term dominates the 2.x 2 term this indeed lies to the left. If we look at this figure 
from e = e 0 as e 0, then the envelope of the images of the boundary form the boundary of the image of the ball B (Q (0). 
In particular, there are points interior to this ball which map to the boundary of the image. For any such point, a smaller ball 
around that point folds interior to this envelope, exposing it, hence there is no open set around such a point which maps to 
an open set, and the map it not open. 

Finally, by choosing a ball around the origin sufficiently large, for any fixed k, the x 2 and y 2 terms will dominate the kx 
term so that f(B R (G) D B R (0). In fact, as (*, y) oo, the map approaches the analytic map /(z) = z 2 - 1. 
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Answers to Chapter VIII, section 1 

VIII.1.3 We must first show that we satisfy the Lipshitz continuity requirement of Theorem 11.1 in Chapter III, namely for 
each wi there is a k,, independent of p, jc, such that 

d(w ip (x), w iq (x)) < ki ■ d P (p, q) 

For w\{x) — k x x, we have 

d{k Xp x, k lq x) = \k lp x -k lq x\ = \x\\k lp -k lq \ = \x\d P (p,q) 

which is not independent of x. Similarly, for w 2 (x) = k 2 x + 1 - k 2 , we have 

d(k 2p x + 1 - k 2p , k 2q x + \-k 2q ) = \k 2p x + \-k 2p - k 2q x - 1 + k 2q I = \x - 1 \d P (p, q) 

also not independent of jc. We must restrict the domain as in exercise VIII. 1.2. We have — .9 < k x , k 2 < .9, and we look for 
an interval [a, b] such that W([a, b]) c [a, b] for all p e P. 

For k 2 > 0, [0, 1] contains all the attractors. To see this, we have 

u>,([0, 1]) = [0,A,] w 2 ([0, l]) = [k 2 , 1] w 3 ([0, 1]) = [1/2, 1] 

When Ai < 0, we have iui([0, 1]) = [k u 0] <£ [0, 1], and likewise w 2 ([0, 1]) = [1, 1 - k 2 ] <£ [0, 1]. One pushes the value for 
the necessary lower bound for the interval down, the other pushes the upper bound up. These values reach their minimum 
and maximum when k { — k 2 = — .9. Consequently, we will have found the proper interval [a, b] when 

W ( _. 9 ,_. 9) ([fl,fc])c[a,fc]. 

Writing this out explicitly, we have 

w x ([a,b]) = [-.9b, -.9a] 

w 2 ([a, b]) = [-.9b + 1.9, -.9a + 1.9] 

w 3 ([a,b]) = [.5a + .5, .5b + .5] 

which means that a, b must satisfy 

a < mm(-.9b, -.9b + 1.9, .5a + .5) 
b > max(-.9a, -.9a + 1.9, .5fc + .5) 
Now in order to include the interval [0, 1] needed by the positive values for k { , k 2 , a < 0, and b>\, which means that 

min(-.9fc, -.9b + 1.9, .5a + .5) = -.9b 
max(-.9a, -.9a + 1.9, .5fc + .5) = -.9a + 1.9 
Setting the inequalities above to equality to solve we have 

a = -.9b b = -.9a + \.9 b = .\Sb+ 1.9 

which yields 

b= 4|^2.3 a^-2.1 

so that, for convenience, the interval [—3, 3] contains all the attractors, and we may prove the continuity on the IFS 

{[-3, 3]; w u w 2 , w 3 } 

to get it for all of K. Looking back to our original jc dependencies we now see that by substituting the endpoints of this 
interval for jc in the distance expressions, we have the following Lipshitz conditions: 

d(w ip (x), w lq (x)) < 3 • d(p, q) 

d(w 2p (x), w 2q {x)) <4- d(p,q) 

Consequently, the attractor varies continuously with k e P. 
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Answers to Chapter VIII, section 2 

VIII.2.2 We first prove the hint. Let N be the set of all points B e H(X) such that B is disconnected. Suppose B e N. Then 
by definition, there are sets B { , B 2 C B, such that 

B x U B 2 = B; B x n B 2 = 0. 

Furthermore, since these sets are closed in B, they are compact. Then if 8 — h{B x , B 2 ), 8 > 0. Suppose B' e H(X) such that 
h(B, B') < 8/3. Then for any x e B', we have d(x, B) < 8/3, and for any y e B, we have d(y, B') < 8/3. Let 

B\ = {be B' :d(b, B x ) < 8/3} 

B' 2 = {beB' : d(b, B 2 ) < 8/3} 

Since each point in B' must be close to a point in B, we must have 

B[ U B' 2 = B'. 

Now suppose that x e B[ n B' 2 ^ 0. Then 

2,8 

d(x, B x ) + d(x, B 2 )< — <8 

and by the triangle inequality, we would have, for some b\ e B\ and b 2 e B 2 , 

d(b u b 2 ) < d(x, b x ) + d(x, b 2 ) < 8 

which is a contradiction. Hence 

B[ n B' 2 = 0 

and B' is also disconnected. But B' was any set within e = 8/3 of fl, so there is an open ball B ( (B) c A^, and N is open in 
W(X). 

Now let ^> be a parameter value such that A(p) e A^, that is, A{p) is disconnected. Then there is an e > 0 such that 
B e (A(p)) c A^. This ball is open, and by the assumptions of the problem, the map A : P — ► H(X) is continuous, so 
A~ l (B e (A(p))) is an open set in P containing p such that every element corresponds to a disconnected attractor. Hence 
the set of points in P for which A(p) is disconnected is open in P. The complement is Ai, which must therefore be closed. 
VIII.2.4 For any value of X in the parameter space, the attractor A can be written as the set of points of the form 

oo 

Since if z € A, then the point 

oo 
k=0 

is also in A, we have 

A = -A = {z : -z e A} 

(in other words the attractor of an element of this family of IFS is itself radially symmetric). Defining 

XA = {kz:ze A} 

to make the notation simple, we have 

A = (kA+ 1)U(AA- 1). 

And ^ 

— AA + 1 = A(— A) + 1 = AA + 1 
— AA - 1 = A(-A) — 1 = AA — 1 
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in other words, A (A.) = A(-k). It then follows that if the attractor of k is connected, so is the attractor of -k since they are 
identical, so M is symmetric about the origin. 
VIII.2.5 We have the IFS 

{€;k z + l,kz- 1} 
where k = (0, 1 /y/2). We may rewrite this as the IFS 

{t 2 ; wi(x,y),w 2 (x,y)} 

where 

Suppose we look at the action of these two maps on a rectangle centered at the origin. Each map flips the rectangle along 
the corners, reduces each side by 1 /y/2, and then translates it either one unit to the left or to the right. We may solve for the 
position of a corner of such a rectangle, symmetric about the origin, such that it went to half the original rectangle. To do 
this, we must have (in absolute value) 

x = \/y/2{y) + 1 
y = l/V2(x) 

which has the solution x = 2, and y — y/2. 

Consequently, if we have the rectangle with corners (±2, ±y/2), Then each map takes this rectangle to one of the 
horizontal halves of the original rectangle, just-touching along x = 0. The attractor is thus the rectangle, which tiles the 
plane. 

VIII.2.7 Suppose that w x {A) n w 2 {A) = {z 0 }, a single point. We wish to argue that if / is a path which intersects itself, 
then / is a point. Certainly such a path which is not trivial must be contained in either Wi(A) or w 2 (A) or, since it must pass 
through zo this is a point of intersection. Then we may argue the same about the points iui(zo) and w 2 (zo), so these are points 
of self intersection. By induction, such a non-trivial non-self-intersecting path can not exist. 
VIII.2.8 Let e = e x e 2 ■ ■ ■ e n ■ ■ ■ be a point in £ a code space on the symbols {-1, 1}. For k e C, define 

f(k) = e l +e 2 k + e 3 k 2 + --- 

We want the radius of convergence of f(k). f(k) is bounded above by 1 + k + k* + = (corresponding to e = 
ll). It is bounded below by -1 - k - k 2 ■ ■ ■ — - £ k n , corresponding to e = -1-1. These two series have a radius of 
convergence of |A.| < 1 and diverge at the endpoints. Consequently f(k) has a radius of convergence of 1. We show that the 
attractor for each k is the set of points 

±1 ±k±k 2 ± ... 

and that the map f(k)(e) =<f>(e). The attractor is the limit of the sequence in H{€) given by {W({0})}. This sequence 
yields 

0h+±lh+±A±lh+±A 2 ±A±lh+--- 

By induction, A (k) is the set of points 

±l±k±k 2 --- 



which is the image of £ under both f(k) and 4> : £ A(k). 
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Answers to Chapter VIII, section 3 

VIII. 3.3 The attractive orbit here is not a cycle, but rather aperiodic. There are other bubbles with periods equal to powers of 
2 multiplied by the various odd numbers out past this point however. Try experimenting with the numbers around X — 1.75 
for instance. 

Answers to Chapter VIII, section 4 

All exercises in this section are programming experiments. 

Answers to Chapter IX, section 1 

IX. 1.3 pi ■ p 3 = 0.03 since the probabilities are independent. 

IX.1.4 The attractor is a Sierpinski triangle with vertices at 0, /, 1. The ball B 4 is located at 2 + / and has radius «J2 and 
therefore intersects the attractor only at the point 1 . The probability of getting only one string in code space is 0. 

IX.1.5 pi = .275, p 2 = .125, and /? 3 = .5. We want to know about the ball B x . B x includes only w x {A), hence one expects 
a probability of .275. 

IX.1.6 There is a set of random strings (of measure 0 in code space) for which this number will not occur, e.g. B 2 in the 
above problem with the string 1 1 has incidence 0. 



Answers to Chapter IX, section 2 

IX.2.2 X is the set of leaves, which is finite. T — 2 X the collection of all subsets of X. Q is the collection of sets consisting 
of one leaf. Any set in J- contains a finite number of leaves, therefore it is the finite union of sets in Q so by definition, Q 
generates T. 

IX.2.3 In order to prove that S is not in the field T we must show that it can not be written as a finite union of sets in T. 
Suppose that S = (J^=i ^« f° r some collection of A n e T where each A n is the union of finitely many subintervals. Then 
at least one of the A n must contain the disjoint union of an infinite number of intervals. But this contradicts the hypothesis 
about A n . Hence S <£T. 

IX.2.4 Let X = [0, 1] x [0, 1] C K 2 , and Q be the set of rectangles of the form [p/q, r/s] x[t/u, v/w] that is subrectangles 
with rational corners. Let T denote the field generated by Q. Let S e T. We need to show that S has rational area, i) (J^ A n 
has rational area, where the A„ are subrectangles since the area of the union is given by X^ =1 ar ea(;4„) — ar ea(;4, n 
Aj). ii) Similarly for any set of A n as above, the complement X \ (J^ =1 A„ = f)" =l X \ A„ has rational area, because X \ A„ 
can be written as the finite union of open subrectangles. Consequently any set S e T has rational area. The area of B(0, 1) 
is 7T which is irrational, consequently B(0, 1) is not in the field. 

IX.2.5 Call the sets in the figure 



a 


G\ 


G 2 


b 


c 


0 


e 


f 


8 


h 


i 


j 


I 


m 


X 


o 



Then we have the following list of equivalences, where those generated earlier in the list are used to generate the rest (once 
they have been established to be in the field). 
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a 


= X\G, 


X 


= Gi Ua 


0 


= X\X 


c 


= G,\G 2 


e 


= c\G, 


j 


= X\e 


i 


= G { \e 


f 


= cUi 


I 


= x\f 


m 


= iUl 


h 


— m\e 


o 


= X\h 


g 


= G,\o 


b 


= X\g 



IX.2.6 Let e be the image of the binary number a = .a x a 2 ■ ■ ■ = % under tne ma P 71 which takes a, < + 1 = e,, in other 
words nia) = e. The complement of e = 7r(a), a irrational, in £ is not the union of a finite number of cylinder sets, which 
under the inverse of this map map to intervals of rational width around rational points in [0, 1]. 

IX. 2.7 X is a space, let T = 2 X , the power set of X. Let A e T. Then since X \ A C X, we have X\AeT. Since 

X, 0CX,0€J. Compliments of subsets of X, and finite unions of subsets of X are subsets of X and are therefore elements 
of T. Hence T is a field (in fact the biggest field over X). 

IX.2.8 B is the a -field generated by rectangles with rational corners on the unit square. These rectangles form a countable 
base in the metric topology induced by the Manhattan metric, as evidenced by the fact that the a -field contains all balls of 
radius € for all e in the statement, and by the equivalence of metrics. To show this is the Borel field, the proof of Theorem 2.3 
shows that the a -field generated by a countable base for the topology of X is the Borel field. 
IX.2.9 £ is the codespace on the symbols {0, 1}. The cylinder sets are defined as all the sets, for all n, of the form 

C(ei ■ ■ ■ e„) = {e a € £ : e„ x = e u . . . , e a „ = e„} 

Let e be a point in E. Then the cylinder set C„(e) is precisely the set of elements x e £ such that x, = e, for / < n. 
Consequently, 

A( . \ei-Xj\ 

By choosing e such that e, = e, for i < n and e, ^ e, for i > n, we have that C n (e) = (e). The balls of radius 3"" form a 
countable base for the metric topology on E, hence, by the proof of theorem 3, they generate the Borel field B on E. 
IX.2.10 Let A C IR 2 be the Sierpinski triangle, and Q be the set of connected components of IR 2 \ A. Notice that since by 
definition of component, this generating set is disjoint. Consequently, one cannot get a subset of a component (other than 
the component itself or the empty set) by union or compliment of the elements in Q. If T is the <r -field generated by Q then 
T likewise contains no nonempty proper subsets of any component of IR 2 \ A. Let € > 0 be chosen so small that if x is the 
center of the largest triangle inside the Sierpinski set, B e (x) fits entirely inside this component. Then B e (x) is not in T, but 
it is in B. Hence T ^ B. T C B however, since any half plane is in B and any triangle is the union of the compliments of 3 
half-planes. 

IX.2.11 Let X be a metric space, Q a generating set for the field T\, and for the a-field T 2 . Let ^ be the a-field generated 
by ^i.Then 
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n=\ 

We must show that this is the smallest a -field containing Q. Suppose not. Then there is an A e T-$ such that A g T 2 . But T x 
is the smallest a -field containing Q so the extra entry must be of the form 

00 

(jA n ;{A„}C ^1- 

n=l 

But the smallest a-field containing Q must contain all of these sets, since it contains T\. Hence T 2 = F 3 . 

IX.2.12 Let T be any field of subsets of X. Choose A e T. Then X \ A e T and the union of these two is in T. The union 

is A U X \ A = X. 

Answers to Chapter IX, section 3 

IX.3.3 Let X — [0, 1] C R T is generated by subintervals, and for a < b we define /x((a, b)) = /x([a, b]) = b - a, and 
fx(A) — sum of lengths of subintervals of A. 

(i) ix-.T-* [0, oo). By definition \x(A) > 0. Since the sum of lengths of disjoint intervals is less than or equal to 
^([0, 1]) = 1 it follows that < cxd. 

(ii) Let [a, b] be an interval, and let b = lim^^an for some sequence {a n } such that a\ e [a, b]. Then the sum of the 
lengths of intervals in the union 

[a, ai] U [a x , a 2 ] U • • • = [a, b] 

is 

oo 

ix([a,a { ] + J^(a„+i - a n ) 

This is a telescoping sum and is therefore equal to 

/x([a, a x ] + /x([a u //ra^^a,,] = [a, b] 

Thus since any element in the field which can be written as the union of countably many elements must be unions of this 
form together with finitely many other elements, \x is countably additive. 

(iii) Let S be the set of rationals in [0,1]. Then S is not in the field T. To see this, we note that neither S nor its compliment 
can be written as the union of finitely many intervals, since both contain countably many components, thus any finite union 
contains an element with countably many components. 

(iv) Let T be the a-field generated by T. For any field or a-field, since 

AHB = X\(X\AUX\B) 

the finite intersection of elements of a field are also in the field. Thus for any x = p/q where p, q are integers, the set 
{x} = [a, x] n [x, b] e T and we have = 0. The rationals (and therefore S) are a countable set, consequently we can 

write S = \J™ =1 A n where {A n } are the elements of S. Consequently, S e T. We have 

oo oo 

£(S)=£((J A « = Z> (A " )= 0 

n=\ n=l *=Plq 

IX.3.4 Let E be the code space on symbols {0, 1}, and define 0<pi<l,p 2 = l- Pi-We define /x on cylinder sets by 

/x(C(ei • • -e„)) = p ex - Pen 

To define /x on other sets in the field T there is really only one way to be consistent with this definition, and satisfy countable 
additivity. We describe this: 

H(X) = n(C(l) U C(2)) = Pl + p 2 = l 
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/x(A UB) = fi(A) + /x(B) - /x(A n B) 

where A n B is in the field, since it is equal to X \ ((X \ A) U (X \ B)) by de Morgan's rules. The easiest way to see how 
this defines a countably additive measure on the field T is to note that for any finite set of codes {e x , . . . e n ] the measure 
defined on the cylinder sets of these codes taken in any order is the same. Notice also that two cylinder sets are disjoint if 
they disagree on any specified symbol. Taken together, the measure of a union of these sets is the product of the codes on 
which they agree, if they are the same length, and this product times the product of the 'tail' of the longer string if they are 
different lengths. There are no elements of the field T which can be formed which specify values for an infinite number of 
places, since this is the disjoint union of an infinite number of components. Consequently, S = {x : x is odd} is not in T, and 
fx({x : x-i = 1}) = p\. The set S e T since it can be written as a countable intersection (and therefore a countable union) of 
finite unions of cylinder sets. Its measure, as above, is the product of the probabilities on which the elements agree, namely 
all odd places, and is therefore lim n ^oo p" = 0. Notice that since the cylinder sets form a countable basis for the metric 
topology on £ , T is the Borel field on £ . 

IX.3.5 Since multiplication is commutative, it is easier to define the indices in reverse order. Thus we want the intervals 

w en o • • • o iy e ,([0, 1]) 

as a generator for the field T. Notice that the interval w x ([0, 1]) has measure p x and the interval w 2 ([0, 1]) has measure p 2 . 
Define ix(X) = 1. Then we must have 

fx(w x OW en O---0 W ei ([0, l])UU) 2 OU) e)) 0...0 W ei ) = p x p en ■ ■ ■ p ex + p 2 p e „ ■ • ■ Pe x 

in order to have even finite additivity, but 

PlPe„ ■■■ Pe { + PlPe n ' " ' Pe x = (P\ + 1 ~ P\)Pe„ • ' • Pe x = Pe„ * - " Pe„ 

so that, since these are the subintervals of w en o • ■ ■ o w e{ ([0, 1]) the 'middle third' has measure 0. Hence we have n(B) = 0 
for any B e T such that B C X \ A. This implies countable additivity, since every element of T must be the union of 
closed or open intervals beginning or ending on the boundary of a subinterval of the above form, and each subinterval 
is the countable union of subintervals, and satisfies countable additivity. Hence ix is a measure on T. This provides 
H(A) = 1, n(X \ A) = 0, n([l/3, 2/3]) = 0. 
IX.3.6 We define T on [0, 1] by using the IFS 

{[0, 1], IWi = ^x, w 2 = ^x + ^, p u p 2 } 

as in the previous problem. Here the subintervals touch, but the definition from problem 4 of ix(X) = 1 yields that the 
intersection of two intervals at their boundaries has measure zero. In this case the field is the set of finite unions of 
subintervals which have boundaries of the form k/2 n . By expressing the numbers on the interval in binary, since any element 
of the interval is the countable sum YlT=i( e " ~ l)/2" for e t e {1, 2} we generate all the subintervals of the interval, hence 
the extension of T is B([0, 1]). In order to generate the measure which takes the measure of an interval as the length of an 
interval, it is necessary and sufficient to have translation independence. This requires p\ = 1/2. 

IX.3.7 Let (X, d) be a compact metric space, \x a Borel measure on X such that ix{X) / 0. Let S be the support of fx on X. 
Then ju-(S) = ju-(X) / 0, so that S / 0. S is closed which implies that S is compact. Hence S e H(X). 

IX.3.8 Let ix be a measure on a -field T, and let T be the class of sets of the form AU B where A e T and B c C e T such 
that fx(C) = 0. T is a a -field since clearly X\B = X\ CL)C\B and C \ B is a subset of an element of T of measure 
zero. The same implies with unions of this form. Define Ji{A U B) = Ji{A) — fx(A). Then if we have a countable union of 
sets of this form, we have 



OO OO 00 

Ji(\J(A H U B n )) = 7I((|J A n ) U (|J B n )) 
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which is the union of an element of T with a subset of an element of T of measure zero. Hence 

OO 00 

7Z(|J(A„u5J)=7I(|Ja„) 
and this latter obeys countable additivity by assumption. Hence T is a cr-field, and Ji is a measure on it. 

Answers to Chapter IX, section 4 

IX.4.1 Let / : [0, 1] -> IR be a piecewise constant function with finitely many discontinuities, located at {x\, . . . x n C R}. 
Let jc 0 = 0, jc„+i = 1. Then / has a single value on each of the intervals 7 0 = [x Q , x x ], .../„ = and we let y, = 

fix), x e [*,-, *,-+,]. Then if x, = Xi*,- ,*,■+,] is the characteristic function on the interval /,-, we may rewrite 

n 

f(x) = J2 yiXl > 

i=0 

If fi is the Borel measure on [0, 1] given by /x([a, b]) = b- a, we have, by definition 

J[0,\] „=o «=o 

For the Riemann integral of / on [0, 1] we have 

p i r x \ r x n+\ 

/ /(jc)^jc = / f{x)dx + ■■■+ / f(x)dx 

r x \ r x n+\ 
= / y 0 dx H h / y n dx 

= yo(x\ - x 0 ) H — y„(x„ + i - x„) 
= f fdn 

J[0,1] 

IX.4.2 We want a measure on (^Euclidean), defined on the field Q = {A : A is a rectangle }, which corresponds to area. 
This means (assuming a<b,c< d),if A = [a, b] x [c, </] or the equivalent expression using open or half open intervals, 
we define = (b - a)(d - c), and for unions of rectangles define fi(A U B) = + fi(B) - n(A n 5). Let Aef 
such that A = |J~ , A„, A„ e ^ with the A„ disjoint. Then for the union to be in T we must be able to express it as a 
finite union of rectangles or the compliments of rectangles. Notice that the compliment of a finite union of rectangles is 
expressible on X as a finite union of rectangles. The measure of this region is then finitely additive by hypothesis, therefore 
fi is countably additive. Suppose there were another measure \i! on T which yielded the area of rectangles in X. Then since 
both measures need to be countably additive, and since they agree on each subrectangle of X, the two measures agree on T. 
Hence /*' = /*. The extension of fi to B is also a measure which yields the area of sets A e B, because B contains all open 
sets in X, the infimum represented in the Caratheodory extension theorem can be replaced by an infimum on open sets, and 
is therefore the area of the closure of A. 

To show that A is in B it suffices to note that it is closed. This follows since the compliment is then open, and can be 
covered by open sets in T. This cover has a finite subcover, which implies that X \ A is a countable union of sets in T, 
hence A e B. 

IX.4.3 The integral is defined on the intersection of two strips with A. To calculate it, we first determine the measure of 
an arbitrary strip [a, b] C [0, 1] intersected with A. The measure is countably additive, so note that given A C A, with the 
measure of A being fi(A), that we have 

H{A) = /x(iw,(A) U w 2 (A)) + fi(w 3 (A)) = (pi + p 2 )li{A) + Pin(A) = .5fi(A) + .5/x(A) 
Suppose that for n < N fi([k/2\ (k + l)/2«] x D& n A) = 1/2" for 0 < k < 2". That is, the measure of A falling within a 



514 Selected Answers 



strip of length 1/2" positioned on a boundary of [0, 1] subdivided into equal 1/2" intervals is its length. We note that each 
such strip intersected with A is a vertical stack of disjoint Sierpinski triangles. We now subdivide the strip into two parts. 
The left-hand parts contains a stack of twice as many triangles, half the size, while the right-hand is a stack of as many 
triangles, half the size. Each triangle in the original strip is produced by transforming A by some sequence (u;, o • • • w k ) 
of transformations, those on the stack in the left-hand portion are either of the form w, o ■ ■ • w k o w x or w, ■ o ■ ■ ■ o w k o w 2 
while those on the right have an included transformation by w 3 . Consequently the measure of the 1/2" width strip is divided 
equally on its two halves. By induction, and finite additivity, the measure of any strip beginning and ending on boundaries 
of the form k/2" is its length. By countable additivity, the measure concentrated on any strip [a, b] x R n A is its length. 

Having demonstrated this, the integral is 1 • 2/3 + (— 1) • 1/3 = 1/3. We will use the above result in further exercises. 
IX.4.4 Let a, ft e R, and /, g simple functions. Then we have 

a f '= 01 J] ytXii W = Yl^y^X'i (;c) 

fig = P Yl ZiXjj (x) = J^iPzdXjj (x) 

are simple functions. By definition, af + fig is af where g = 0, and fig where / = 0 and the sum elsewhere. We need to 
write the collection of sets |J /, > U (J Jj as the union of disjoint sets. This is done by writing 

Kk = h\[J Jj, K m +j = Jj \ [J //, K m+n+(i+j) = /,■ n Jj, K k = 0 elsewhere. 

Define w, = ay h wj = fizj, w k — ay, + fizj on the sets defined above. Then w(x) = ^ WkXfc k is a simple function, and 
equal to af + fig. The integral of the sum is the sum of the integrals by the above definition of w(x). 
IX.4.6 We need to rewrite the function, as in the above problem, as a simple function with disjoint sets. Writing C = 
A \ B, D = B \ A, E = AD B,we have 

f(x)=Xc + (2.3)XD + (l+2.3) X E 

To evaluate the integral, we need the measures on each of these sets. We define them in terms of strings: C is the set of 
strings beginning with 1, and not having a 2 in the second place, i.e., C = C(l, 1) the cylinder set beginning in two l's. D 
is the set of strings not beginning with 1 which have a 2 in the second place, or, D — C(2, 2). E is the set beginning with 1 
with a 2 in the second place, or, E = C(l, 2). By the definition of /x we have /x(C) = p x p\ — (A) 2 , /x(D) = p 2 p 2 = (.6) 2 , 
fx(E) = pip 2 = (.4) (.6). The integral is given by 

J fdfi = (A) 2 + (2.3)(.6) 2 + (1 + 2.3)(.4)(.6) = 1.78 

IX.4.7 Define the Borel measure 8 a by fi(B) = 1 if a e B and 0 otherwise. Let / be continuous / : (X, d) -> R. For each 
n, denote by {#;}*" j a Borel partition of X into disjoint sets where diam(#, < l/n. Define {/„} to be a sequence of simple 
functions defined on each partition by /„ = yiXB t with = /(*,) on the sets which do not include a and y ia — a on the 
set in each partition a e B ia . Then since the measure of all the sets is zero with the exception of B ia on which it is 1 , we have 

J fnd& a = f(a)Vn. 

Hence the limit as n -> 00 is f(a). Our partition is independent of the choice of jc, with the exception of the choice of y ia , 
but the function is continuous, so that as n -> 00 another choice of x ia approaches a as the diameter of B ia goes to zero. 
IX.4.8 We proceed by first deducing that since /(jc, y) is continuous, and the measure on ■ is defined on rectangles by their 
area, the value of the integral is the Riemann integral of /(jc, y). By definition, the Riemann integral is done by taking a 
sequence of rectangular partitions of ■, and constructing two sequences of simple functions, choosing y t , y[ by 

y t = inf {/(jc, y) : (jc, y) e B,}; y'. = sup{/(jc, y) : (jc, y) e B,} 

By Theorem 4.1, since /(jc, y) is continuous, the two sequences converge to the same limit, namely f^fd/x, since they 
converge and converge to the same limit, the Riemann integral is defined and is equal to this limit. The value of the integral, 
by taking the Riemann integral, is 1/2. 
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IX.4.10 Refering to exercise 3.4, the measure of any strip intersecting A is simply it's width. The function f(x, y) = x 
is independent of y so we partition the space into strips, their intersection with A produces decreasing diameters. f(x) is 
continuous, hence this partition and the choice of x t are immaterial. Choose x t to be the left-hand boundary of each strip, 
intersecting some point on A. Then the value of the integral is just /J x 2 dx - 1/3. Let X be the set of pixels on a computer 
display. For simplicity, define the distance between the pixel (/, j) on the i ,h row and in the j" 1 column and the pixel (k, i) 
as |/ _ fc\ + \ j - ii which is a Manhattan metric. Then X may be related to the subset of H 2 consisting of integer coordinate 
points in the first quadrant of the same size. This set is closed and totally bounded. Hence X is compact. Since there are 
finitely many pixels, we may define the generator for the Borel subsets to be all the subsets consisting of one pixel each. 
This defines the Borel field on X to be the power set of X. Thus any set of pixels is a Borel subset. Suppose there are J rows 
and K columns so that the total number of pixels is N = J x K. Define a Borel measure by fi(A) = (number of pixels in 
A)/N. Any function from X -> M. is continuous because the pixels themselves are open sets, hence the inverse image of an 
open set in R is always open. 

We will choose the function /(/, j) - i + j. Any function on the finite set of pixels can be described by its value on 
each pixel, hence its value at a pixel times the characteristic function at that pixel, hence the sum of such terms makes any 
function simple. Then the integral of / over X is 



where the summation notation indicates summing over all pixels in X. Since there are K pixels with first value /, and J 
pixels with second value j , we have 





1 N(J + \) + N(K + 1) 
N 2 



so the integral is 
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IX.5.1 Let 



X = 



€{1,2, 3,...,K}} 



and define a metric d : X — > K. by 

d((i u ji), (12. h)) = \h ~ h\ + \ j\ ~ jz\ 
Then (X, d) is a compact metric space. Define /x to be an atomic measure with 




and let v be another atomic measure with 



v((/,7)) = -j ?5 



for every (/, j) e X. 
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We wish to calculate the distance in the Hutchinson metric between fx and v, given by 

d H (fi, v) = sup{ f fdfi- f fdv : / e C(X, R), - f(y)\ < y)Vx, y € X} 
Jx Jx 

where C(X, R) is the set of continuous functions from X to R. Now X is a discrete space under a metric which is the 
restriction of the Manhattan metric on R 2 , consequently / : X R is always continuous. As well, the integrals are the 
integrals of simple functions. Hence for every /, we the integrals can be expressed 



so the difference is, for any function /, 



5 f{i ' j) (a^kk 1 - -h) = h ? /0 '' j) (^tt - 0 

Intuitively, one is tempted to pick the function f(i,j) = i+ j, which among the set of valid functions is the largest of the 
form 

f{i,j)=ai+fij 



However, the range of the term (i + j)/(K + 1) - 1 is from 2/(K + 1) - 1 = -(K - \)/{K + 1) to 2K/(K + 1) - 1 = 
(K - \)/{K + 1) which gives a negative coefficient to f(i, j) over almost half of X detracting from the value of the sum. 
We must therefore endeavor to pick /(/, j) as large as we can in absolute value, negative when the measure subtraction term 
is negative, and positive when it is positive. That is, we maximize by choosing the largest value for |/| allowable under the 
distance constraint where |(i + j)/(K + 1) - 1| = \C(i, j)\ is largest. The interested reader can confirm that this exceeds 
the intuitive result. The value of C(i, j) is constant along each diagonal from (1, n) to (n, 1) and is equal to zero along 
the major diagonal from (1, K) to (K, 1). It is anti-symmetric with respect to this diagonal, negative above it and positive 
below. 

We thus place the largest values for |/| at (1, 1) and (K, K), and since the distance between these is 2K - 2, we set them 
to ±(K — 1). We then proceed outward from these points in Manhattan metric circles. The distance between such circles 
(which are the diagonals where they intersect X) is always the horizontal distance between the extensions of the diagonals, 
thus we end up satisfying the distance constraint by putting / to 



■(K-\) -{K-2) 
-(K-2) -(K-3) 



-1 
0 



-1 
0 



(K-3) (K-2) 
(K-2) (K-l) 



This may be summed by taking twice the sum above the diagonal, and noticing that there are n entries where (K — n) is the 
absolute value of the entry, yielding 
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2 K 

— J^n(K -n) 2 

K 2 (K + 1) 

2 K 

= — T^(nK 2 - 2n 2 K + n 3 ) 

K 2 (K + 1) 

~ K 2 (K + l) ^ H ~ K 2 (K + 1) + K 2 (K + 1) ^ 

_2K 2 K(K + l) 4KK(K + \)(2K + 1) 2/f 2 (/f + l) 2 

~ 2K 2 (K + 1) tf 2 (tf + 1) + 4tf 2 (tf- + 1) 

K - 1 
= — - — =fl//(M, v) 
o 

IX.5.2 Since yLi e "P(X) the measure of the entire space is £i(X) = 1. This value is identical with the value on the support 
of [i which we will call S, that is, this implies /x(S) = 1. In particular it is non-zero, and since, by Theorem 3.1 in Chapter 
IX statement (3) we have /x(0) = 0, S ^ 0. Then by Theorem 3.4 in Chapter IX, the support of a Borel measure on a metric 
space is closed. Since X is compact, a closed subset of X is also compact, hence S is compact. Since S is compact and 
non-empty, S, the support of n is an element of H(X). 



Answers to Chapter IX, section 6 

IX.6.1 We want to show that the Markov operator is indeed a transformation on V(X), that is, M(v) : V(X) -> V(X). It 
suffices to show that 

vo W r l :V(X)^V(X) 

since a sum of these operators is a Borel measure if they are, and p, — 1 indicates that they are normalized if each 
entry is. 

(i) k;," 1 : X X, since for hyperbolic IFS these are invertible maps. In addition, if we define w~ l ({x}) — 0 whenever 
x & Wj(X) then w~ l (X) = X. Hence y o wJ~ l (X) = 1, and this is a normalized Borel measure. Hence M(y)(X) = 1. 

(ii) v o u;," 1 is a map X [0, oo) (actually X [0, 1]). Furthermore, by definition, if {Aj}f =] are mutually disjoint, 

00 oo 

j= i j= i 

oo oc 

= J2 v(w7\Ai)) = J](v o wJ l )(Aj) 
;=i j=i 

so that y o u;," 1 is countably additive. Since u;," 1 maps open sets to open sets (because Wj is a contraction, it is continu- 
ous),and since w~[\X \ A) = wJ x (X) \ wf l (A), w~r x takes the generators for the Borel subsets into themselves, and pre- 
serves the operation of set complementation. Hence w^ 1 : B B, and the expression v o w~ x is well defined. Hence we 
have y o W r l : V(X) -* V(X) and therefore M(y) : V(X) -* V{X), as desired. 
IX.6.2 We define L by 

L = {/ e C°(X, R) : |/(x) - /(y)| < d(x, y)Vx, y e X} 

and let / e L. Define if = s -1 o iu,-, and we wish to show that / e L. Since / is a scalar times the sum of 

compositions of continuous functions, / is clearly in C°(X, IR). We must therefore only verify the distance condition. For 
all x , y e X we have 
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\~f(x) - f(y)\ = \s~* \f o Wi (x) - f o Wi (y)\ 

i 

< s~ l ^2 Pi\Wi(x) - Wi(y)\ since / e L 

i 

< s~ x Pi s \ x ~ y \ since s = max s, 

i 

<s~ l -s\x -y\ ■ ^2 Pi 

i 

<\x — y\= d(x, y) since ^ pi — 1 

Hence / e L. 

IX.6.7 Let X consist of three points: X = [A, B, C}, and let an IFS be defined by 

{X; w u w 2 ; pi = .6, p 2 = .4} 

where 

Wi(A) = B w x (B) = B Wi(C) = B 
w 2 (A) = C w 2 (B) = A w 2 (C) = C 

Furthermore, let ixq(A) = fi 0 (B) = iia(C) — 1 /3. We now let fi„ = M n (/i 0 ) and find constant entries for the Markov transi- 
tion matrix M,a,b, . . . , i, such that 

/i„(A)\ /a b c\ //i„_i(A) 
H n (B) = [d e f )[nn-dB) 
H n {C)J \g h i / \n„-dC) 



By definition, 



so that 



and we have 



giving us 



H„ = .6(/i n _! ow), 1 )! .4(ai„_! o w 2 l ) 

/i„(A) = .6(/i„_i o w;\A)) + Aifin-i o w?(A)) = 0 + .4/i„_i(fl) 
fi n (B) = -6(/i„_i(A) + /i„_i(fl) + /i„_i(C)) + 0 
/i„(C) = 0 + .4(/i„_i(A) + /i„_i(C)) 

/0 .4 0 
M = I .6 .6 .6 
\ 0 .4 .4 

For a discrete space like this, M completely determines M : V(X) 7>(X), by defining at each point in the space. In 
the limit, M" M, an operator which takes every point in V(X) to some measure fi /, and which is therefore necessarily 
singular. By definition fif = M(/i/). This is an eigenvalue equation which means that fi f is an eigenvector of the eigen- 
value 1. 
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IX.6.8 We first look at the sequence of measures {fM n }^Lo- We nave 

Mi(^) = Pi^o(w~\A)) 

i 

Vi(A-) = ^2piiAi(w~ l (A)) 

i 

= E^(E^^7 l (u;r 1 (A))) 

' i 

= Pi(%2 Pj^oiwj 1 o wr\A))) 

i j 

Since the sum after n such iterations is finite, the order is not important and we can write 

Hn(A) = ]P PiPj ■ ■ ■ p k iAo(Wk l o-- - owj 1 o u>f '(A)) 
i,j,....k 

where the sum indicates a sum over all the indices i, j, . . . ,k. 

Ho is concentrated on A since, by assumption, /x 0 € V(X), and /x 0 (A) = 1 so the support of /x 0 is a subset of A. If x & A, 
all the terms in the above sum are zero because of this, hence the support of /x n is again a subset of A, and we must have 
lx n (A) = 1, because M : V(X) P(X). Now let /u, be the unique invariant measure of the operator M, that is M(/x) = /a. 
From the formula just derived with respect to ix 0 for M"(ix 0 ), and from the invariance of /x we must have, for any set B, 

fi(B) = ^ ptpj ■ ■ ■ pkl^iw^ 1 o ■■■ o wj l o w^ l (B)) 

i,j,...,k 

Now choose B = w, ■ o Wj ■ o ■ ■ ■ o tu*(A). Then we have 

£1(U>; O Wj O • • • O W k (A)) = ^ /? m /?„ • • • Pl(w~ l O UT 1 O • • • U)J~ l (Wj O Wj o ■ ■ ■ o W k (A)) 

m,n,...,l 

where the sum is over the same number of indices as in the composition for B. Since each element of the sum is positive or 
zero, the sum is greater than or equal to each of its entries, and we have 

lx{Wi o Wj o ■ ■ ■ o u^(A)) 

> PiPj ■ ■ ■ Pk(W^ X O • • • O W~ l O W^~ l (Wi O Wj O • • • O Wk(A)) 

> PiPj ■ ■ ■ PkV(A) = PiPj ■ ■ p k 

Equality holds precisely when this is the only contributing term of the sum, that is 

H(W~ 1 O W~ X O • • • O W^ l (Wi O Wj o • • • o w k (A)) =0 

whenever the ordered set of indices (m, n, ...,/) ^ (i, j, . . . , k), which says for all of these ordered sets the set 

W~ l O W~ X O • • • O Wf l (Wi O Wj O • • • O Wlc(A)) 

is at least of measure zero, if not empty. It is empty for the totally disconnected case, hence equality holds when the IFS is 
totally disconnected. 

IX.6.11 The measure /u, has support on A, consequently it suffices to restrict to Borel subsets of A. We first let the IFS be 
totally disconnected. Then the sets of the form 

Wi == W a „ O W an l O ■■■ O W ax {A) 

for all n form a countable basis on the space A. Hence since for the totally disconnected case, these are all both closed and 
open, for any Borel subset B of A we have 
•* 

00 

B - |J Wt 
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for some disjoint choice of W,-. But 

00 00 00 

i=\ i=l (=1 

where C, are disjoint cylinder sets in £ . The measure p on £ by observation gives the measure on any cylinder set as 

p(C(a u or*)) = p x ---p k 

which is precisely the measure \x applied to the corresponding basis set W, on A. Consequently, the measures agree, and we 
have 

When the attractor is not totally disconnected, we have for each W, finitely many cylinder sets in the image 0 _1 (W;), which 
are disjoint, and correspond to terms in the sum given in exercise IX.6.8. Consequently the measures agree on Borel subsets 
of A in this case as well. Since the Borel subsets of A are precisely the subsets A D B where B is a Borel subset of X, and 
since the support of the measure is A, the equality holds for any Borel subset of X. 



Answers to Chapter IX, section 7 

IX.7.1 We want to prove that x + 8]) — 8, where is the invariant measure of the IFS 

{[0, 1]; wi = l/2x, w 2 = l/2x + 1/2; 1/2, 1/2} 

To do this it suffices to show that this expression holds on any closed interval of the form [k/2", (k + l)/2"] C [0, 1], since 
we may then build an approximation of any interval from these, good to any error, by the disjoint union of subsets of this 
form which cover the interval. Since \x is defined as the infimum of the measures of such covers, we let the error go to zero. 

To show that ii([k/2", (k + l)/2"]) = 1/2" we note that for each iteration of W(A) the image is 2 times the number of 
subintervals of A of half the length of those of the iteration before. Each of these subintervals is of the form 

w ai o • • • o w ak (A) 

where there are k terms in the composition for W k (A). Consequently, these intervals of length 1 /2 k have measure 

(l(Wol O • • • O W ak (A) = p ax ■ ■ ■ p ak 

Since each term in the product is 1 /2, the right hand side is 1 /2 k as desired. 

IX.7.2 We must prove the /u. applied to any rectangle in ■ gives the area of the rectangle, where is the invariant measure 
of the IFS 



{■; w u w 2 , u> 3 , m\ 1/4, 1/4, 1/4, 1/4} 



and the wi are given by 



U>2 



U>3 



1X>4 



(;)=(; ?)(;) 
(;)-(! !)(;)♦(» 
(;)■(! °)(;Mi) 

CMS DGM!) 
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that the invariant measure /x is uniform, that is, that for any rectangle, we have /x(([a, b], [c, d])) = (c - a)(d - b). We 
argue as in the previous problem: For any square with a side of length of the form 

k k+l I 1+ 1 

5_([ 2^' ~2^ L[ 2"' 2" J) 

we have some sequence i, k of digits, of length n such that this square is equal to 

S — Wi o Wj o • • • o tu,t(i4) 

and hence its measure is 

1 2 

lx(S) = piPj-- p k = { — ) 

so that the measure has the uniform property on such squares. These squares form a countable base for the Borel subsets of 
A, consequently any rectangle may be written as 

oo 

([a,b],[cd]) = \JSi 

(=i 

where the 5, have been chosen such that /x(5, n Sj) = 0, that is, they meet only on the edges. Consequently, by countable 
additivity, we have /x(([a, b], [c, d])) = (c - a)(d - b), for this rectangle, consequently the measure of any rectangle in A 
is its area. 

IX.7.4 The distance between two points in ([0, \] p , d p ) can be written 

d p (a,b) = \d p (a,0)-d p (b,0)\ 

We start with the sets in each space corresponding to addresses in a cylinder set in code space, namely 

0(C(cr 1 , ct 2 , cr«)) and <t> P (C{o x ,a 2 , . . . , a n )). 

If the measures agree on the size of these sets, which form a base for the Borel field in each space then since we can rewrite 
/ on these as 

f(C(a x ,a 2 , ...,a n )) = (p^' 1 (C(a x , a 2 , . . . o n ))) 

we will have demonstrated that the measures are equal. On [0,1], almost by definition, the invariant measure v applied to a 
cylinder set yields 

v(C(a u a 2 , o n )) = pip 2 ■■■ p n , 

that is, the probability of landing in the cylinder set on n iterations. 
On [0, \] p , the cylinder set corresponds to an interval with endpoints 

■0\a 2 ■ ■ ■ er„0 and .a x a 2 . . . a n (N — 1) = .o x o 2 . . . (a n + 1)0 

How wide is this interval? Start with n = l. The interval (.a u .(ai + 1)) can be read from the IFS we used to construct 
[0, \] p , to be of width p ay A contraction of the same relative subdivision of [0, 1] is placed inside it to make our "tick 
marks" by the IFS, so that the portion of this subinterval whicn is also in the interval (.a x a 2 , .a x (a 2 + 1)) is thus p ax Pa 2 - 
If this proceeds in this fashion up to a k , then the subinterval which is again divided by the action of the IFS is of length 
Pa x Pa 2 • ■ • /> a(t , and we take a fraction of this equal to p ak+x times its length. Consequently by induction, the length of the 
image of a cylinder set is 

lx(<p p (C(o x ,a 2 , cr„))) = d(.a x a 2 ■ ■ ■ a n , .o x a 2 ■ ■ ■ (a n + 1)) = p ax Pa 2 ■ ■ ■ Pa n 

which agrees with the value on [0, 1] for the image of the same set under the invariant measure v. Hence v(A) = fi(f(A)), 
and / is measure preserving. 

IX.7.5 The diagonal of the square forms right triangles with the vertical lines generated by the tick marks on the x-axis, and 
with the horizontal grid lines generated by tick marks on the y-axis. The slope of this line is 1, hence the distance between 
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horizontal grid lines on the diagonal is V2 times the distance on the x-axis. Consequently, a point picked at random on the 
diagonal has a p t chance of falling between the i — 1 th grid mark and the i th grid mark, and so forth. With probability 1 then, 
a point on the diagonal will have each digit occuring in proportion to the p t spacing on the jc-axis in its x coordinate, and 
the digits occur with equal frequency in its y coordinate. 

Answers to Chapter IX, section 8 

There are no mathematical exercises in section 8. 



Index 



V , definition, 10 

■7\l , definition, 9 

v ' , definition, 9 
0 . definition, 9 
A , definition, 9 
■ , definition, 9 
|A|, definition, 58 
A C X, definition, 9 



address, 2, 115, 122, 125, 141, 149, 274, 300, 
359 

definition, 125 

illustration, 116, 117, 118, 119, 120, 121, 124 

periodic, see periodic address 
affine transformation, 2, 43, 45, 49, 51-55, 53, 
70-74, 77, 95, 97, 98, 99, 100, 113, 118, 
119, 130, 136, 142, 145, 211, 219, 225, 
227, 233, 236, 239, 243, 253, 272, 332, 
361, 376 

definition, 49 

illustration, 45, 48, 51, 52, 56, 57, 72, 98, 100, 

101, 102, 103, 104, 106, 107, 233, Plate 9-11 

see also transformation, affine 
algorithm, see fractals; specific algorithms 

for computing 
analytic transformation, 61-68, 74, 289 

definition, 66 
Andes Girl, 377 

illustration, Plate 27 
Arctic Wolf, illustration, Plate 28 
attractive cycle, 147, 268, 271, 314 

definition, 132 
attractive fixed point, 133, 136, 147, 247 

definition, 132 

illustration, 133-148 
attractive periodic point 

definition, 133 
attractor, 81, 83-84, 85, 87, 88, 90, 91, 92, 93, 
94, 95, 96, 97, 99, l6o, 101, 103, 111, 113, 
115, 116, 117, 118-119, 122, 123, 125-131, 
133, 134, 136, 141, 145, 146, 149, 152, 153, 



154, 155, 156, 158, 159, 169, 170, 177, 182, 
183, 184, 185, 199, 200, 202, 211, 212, 214, 
215, 218, 223, 226, 232, 234, 243, 246, 252, 
253, 260, 266, 271, 272, 280, 282, 298, 299, 
301, 302, 305, 307, 308, 309, 313, 318, 319, 
327, 331, 332, 333, 343, 345, 347, 359, 363, 
367, 376, 382 
definition, 81 

illustration, 82, 92, 93, 101, 102, 103, 104, 

105,116,121,187, 201,231,311 
strange, definition, 81 
A \ B, definition, 10 



basin of attraction, 278, 281 

definition, 258 

illustration, 280, 281 
biological modelling, 3, 282 
Black Forest, 377-378 

illustration, Plates 31-34 
Bolzano-Weierstrass Theorem, 22 

illustration, 22 

proof, 20 

Borel field, 340, 341, 342, 343, 345, 347 

definition, 340 
Borel measure, 4, 341, 343, 344, 347, 348, 349, 
365, 372 

illustration, Plate 23 
Borel measure, normalized, 349, 350, 404 

definition, 349 
Borel subset, 335, 340, 341, 345, 363, 365, 366 

definition, 335, 340 
boundary, 24, 54, 259, 263, 266, 267, 278, 296, 
298, 301, 305, 310, 314, 317, 318, 324, 326 

definition, 23 

illustration, 18, 24, 25, 104, 120, 259 
bounded, 21, 171, 195, 198, 201, 310 

definition, 20 

totally, 20 
boundedness, 15 

Box Counting Theorem, 175-176, 188, 
190-191, 224 
illustration, 175, 178, 179 
proof, 175 



C, definition, 43, 44 
C, definition, 7 
C, definition, 7 

Cantor set, 140, 155, 165, 180, 259, 279 

illustration, 44 
Carath6odory's Extension Theorem, 4, 342 

proof, 342 

see also Extension Theorem 
Cartesian map, 24 
Cartesian product, 26, 123, 229 

definition, 9 
Castle fractal, IFS code for, 186 

illustration, 185 
Cauchy sequence, 16, 17, 19, 20, 35, 36, 38, 42, 
45, 77, 78, 92, 124, 171, 210, 257, 288, 348, 

definition, 15 

illustration, 18, 29, 35, 39 
Cauchy subsequence, 

extension of, 34, 35 

illustration, 35 
cell, 324 

Chaos Algorithm, illustration, 90 
Chaos Game, 2, 3 

for a recurrent iterated function system, 385-388 
chaotic, definition, 167 
chaotic dynamical system, 2, 164, 167-169, 
193, 275, 278, 285 

definition, 167 
characteristic function, definition, 344 
X, definition, 344 

classical Cantor set, see Cantor set, classical 
closed, 20, 21, 23, 24, 172, 288 

definition, 19 
closedness, 2, 15 
closure, 129 

definition, 19 
Cloud Study (video) 113, 377 
illustration, Plate 25 
code space, 78, 84, 119, 120, 121-123, 125, 
126, 127, 128, 129, 130, 140, 141, 146, 149, 
152, 160, 162, 164, 167, 170, 273, 274, 299, 
308, 339, 341, 343, 352, 359, 363, 365 
definition, 18, 120 
Collage, 376-378 

Collage Theorem, 2, 94, 95, 97, 100, 101, 186, 
221, 275, 378, 380 
illustration, 96, 97, 98, 102, 103, 107, 108 
proof, 103 

for a recurrent iterated function system, 392-403 
Collage Theorem for Measures, 360, 361 
illustration, 361 
proof, 360-361 
compact, 20, 22, 23, 24, 41, 79, 92, 123, 124, 
125, 180, 184, 215, 243, 256, 258, 272, 287, 
288, 340, 342 
definition, 20 



compact metric, 146 

compact metric space, see metric space, com- 
pact 

compactness, 2, 15, 23, 29, 159, 172, 299 
completeness, 2, 15 
definition, 16 

see also metric space, complete 
Completeness of the Space of Fractals Theo- 
rem, 35-37 

proof, 36-37 
completion, of a measure, 344 

definition, 344 
computer programs 

Deterministic Algorithm, 85, 87, 210 

Escape Time Algorithm, 249-250, 253, 254, 
277 

see also Escape Time Algorithm, illustration 
hidden variable fractal interpolation function 

with Random Iteration Algorithm, 234-236 
invariant measure of an IFS with probabilities, 
370-372 
illustration, 373 
parameter space coloring, 311-312, 324 
Random Iteration Algorithm, 87-90, 216 
Random Iteration Algorithm with interpola- 
tion and vertical scaling factors, 213-215 
illustration, 215 
web diagram, 135-136 
condensation, 218, 276 
condensation set, 91-94 

definition, 91 
condensation transformation, definition, 91 
conformal, 7-8 

connected, 26, 27, 83, 257, 299, 302, 303, 313, 
314, 318, 326 

definition, 24 

illustration, 28 
connected, multiply, 26, 27 

definition, 26 

illustration, 28 
connected, pathwise, 26, 27, 238, 243, 256, 261 

definition, 24 

illustration, 109 
connected, simply, 26, 27 

definition, 26 
connectedness, 

see also connected; connection 
connection, illustration, Plate 16 
continuous, 40-41, 62, 79, 103, 104, 106, 110, 
111, 124, 125, 127, 128, 147, 217, 221, 257, 
268, 287-289, 347, 350, 363 
continuous, uniformly, 112-113 

definition, 40-41 
continuous deformation, definition, 26 
continuous function, 6, 229, 233, 237, 238, 242, 
245, 349, 356, 357, 370, 371 



definition, 14 

illustration, 8 
continuous transformation, 122, 288, 299 

illustration, 150 
Contraction Mapping Theorem, 75, 217 

proof, 73 

contraction mapping, 2, 4, 74-75, 77, 78, 79, 
80, 81, 83, 84, 91, 92, 95, 102, 104, 131, 217, 
218, 357 
definition, 74 
illustration, 76, 218 
contractive transformation, 76, 80, 236 

illustration, 76 
contractivity, 75, 79, 80, 81, 83, 84, 91, 93, 95, 

96,97,102,111,158,163,218 
contractivity factor, 232, 298, 300, 304 

definition, 74 
convergence, 2, 15, 16, 20, 124, 171, 268, 
277, 288 
definition 15, 16 
illustration, 39, 46 
convergent sequences, 34, 40, 83, 88, 276, 
353, 354, 356 
illustration, 82, 83 243, 244 
coordinate changes, 68-74, 77, 99-109 

illustration, 70, 71 
coordinate space, 68 

illustration, 70 
coordinate system, 68, 69, 71, 72, 176, 190, 
238, 294 
illustration, 69 
countable, 22, 84, 122, 125, 318 
definition, 120 
illustration, 124 
countable base, 340 

definition, 340 
critical point, 62, 64, 66, 67, 313 
definition, 47, 66 
illustration, 48 
critical value, 67 
definition, 48, 66 
illustration, 48 
cycle, 170-171, 274, 276 
definition, 132 

see also attractive cycle; repulsive cycle 
cylinder subset, 339 
definition, 339 



dS, definition, 24 

data, 192, 193-196, 205, 208, 211, 213, 215, 
218, 219, 222, 223-224, 233 
definition, 208 
generalized, 229, 232 
definition, 229 
illustration, 195 



deformation, bounded, 13-14 
deformation, of space, 47, 52-53, 179 
degree, of transformation, definition, 47 
b u (B), definition, 348 
delta function, definition, 348-349 
dense, 167, 168, 170, 274 

definition, 165 
Desktop Fractal Design System, 399 
Deterministic Algorithm, 2, 86, 88, 219 
computer program for, 88-90, 210 
deterministic fractals, 379 
deterministic shift dynamical system, illustra- 
tion, 166 
D H (A), definition, 201 
d H v), definition, 349 
dmm(A), definition, 199 
diameter, of partition, definition, 347 
diffeomorphic, 148 
differentiable, 148, 318 

definition, 148 
disconnected, 27, 192, 319 

definition, 24 
disconnected, pathwise, 26 

definition, 24 
disconnected, totally, 29, 83, 115, 117, 127, 
140, 146, 148, 149, 154, 158, 164, 165, 169, 
170, 183, 186, 199, 206, 253, 299, 302, 303, 
319, 320, 326, 359 
definition, 24, 125 
illustration, 155 
disconnection, illustration, Plate 16 
dynamical system, 2, 81, 130-140, 141-142, 
149, 151-152, 246, 250, 252, 253, 255, 266, 
268, 271, 272, 276, 297, 309, 312, 313, 314, 
317, 318, 324, 326, 327 
definition, 130 

sensitive to initial conditions, definition, 167 
stretch, squeeze, and bend, illustration, 131, 
132 

transitive, definition, 167 

see also chaotic dynamical system; set dy- 
namical system; shift dynamical system; 
slide and fold dynamical system; stretch, 
squeeze, and bend dynamical system; 
transitive dynamical system 

dynamical systems, family of, illustration, 
Plates 17-20 

dynamics, on fractals, 2 



€= , definition, 5 

Elton's ergodic theorem, 4, 364-370, 376 
6-net, 20 

definition, 20 
equivalence, 2, 3, 146, 147, 148, 164, 165, 
206, 287 



see also metric space, equivalence; metrically 
equivalent 

Escape Time Algorithm, 3, 246-266, 271, 272, 
277, 282, 288, 289, 292, 293, 319, 324 
computer program, 249-250, 253, 254, 255, 
278 

illustration, 251, 253, 254, 259, 264, 263, 
264-265, 267, 268, 287, 290, 291, 320-324, 
325, 328, 329, Plates 6-14 
Euclidean geometry, see geometry, classical 
eventually periodic point, definition, 133 
see also periodic, eventually 
Extension Lemma, 34—35 
extension of a Cauchy subsequence, 34, 35 
Extension Theorem, Caratheodory's, 4 



7, definition, 337-338 
definition, 42, 43 
(A) (for a set A), definition, 65 
family resemblances, 326 
Feigenbaum ratio, 314 
fern, 92, 170, 282 

IFS code for, 85, 90, 91 

illustration, 32, 39, 91, 166, 346, Plate 3 
fern, Black Spleenwort, 97, 126 

IFS code for, 99 

illustration, 102 
field, 338, 339, 342, 343 

definition, 337-338 

illustration, 339 

see also Borel field; sigma-field 
field, generated by <3, definition, 338 
fixed point, 72, 73, 75, 76, 77, 78, 79, 81, 97, 
102, 103, 127, 152, 218, 268, 272, 280, 
288, 352 
definition, 71 
table, 103 

see also attractive fixed point; repulsive 
fixed point 
folds, 46-47 

illustration, 50 
/ on , definition, 43 

fractal dimension, 3, 15, 146, 171, 206, 207, 

212, 223, 224, 224-226, 233, 247, 252, 253, 

266, 278, 302, 303, 308 
calculating, 173-175 
definition, 171, 173, 180-181 
experimental determination of, 188-195, 227 

illustration, 188, 190, 191, 192, 193, 194, 
196, 197 

table, 189, 191 
idea behind, 172-173 

illustration, 172, 176, 177, 178, 179, 183, 185, 
186, 187, 201, 202, 203, 204, 231 



of fractal interpolation functions, 223, 226 
fractal function, hidden variable, 240-243 
fractal interpolation function, 3—4, 207, 218, 
219, 222, 223, 226, 227 
definition, 218 
generalized, 233, 237 
definition, 233 
illustration, 239 
hidden variable, 234, 236, 237-238 

computer program for, 234—236 
illustration, 208, 209, 212, 220, 221, 222, 223, 
224, 225, 228-229, 230, 231 
fractal interpolation, 3 

hidden variable, illustration, 237 
fractal systems, 379, 383 
fractal tree, 2, 93 

fractal, deterministic, definition, 79 
fractals, algorithms for computing, 2, 84—91 
see also "Chaos Game"; Deterministic 
Algorithm; Random Iteration Algorithm 
function 
characteristic, definition, 344 
simple 
definition, 344 
see also simple function 
see also specific functions 



generalized data, definition, 229 

see also data, generalized 
geodesies, definition, 10 
geometric series, 46 

illustration, 46 
geometry, classical, 1, 205, 206, 207 

illustration, 114, 206 
geometry, deterministic, 4, 42, 236, 373 
graph, 233, 234, 237, 240, 243, 252, 272, 345 

definition, 229 
group, 73, 74 

definition, 73 



H(X), definition, 27 

illustration, 30 
Hausdorff distance, 45, 79, 95, 207, 289, 297 

definition, 32 

illustration, 32 
Hausdorff metric, 33, 79, 80, 85, 95, 111, 207 
219, 268, 289, 326, 359 

illustration, 112 
Hausdorff /7-dimensional measure, 201 
definition, 200 

illustration, 201 
Hausdorff-Besicovitch, 3 
Hausdorff-Besicovitch dimension, 200, 202, 



203, 204, 205, 295 

definition, 202 

illustration, 205 
hidden variable, 234-235 

definition, 234-235 

see also fractal interpolation function, 
hidden variable 
homeomorphic, 14, 145-146, 299 

definition, 145-146 
homeomorphism, 15, 41, 145, 146, 148, 152, 
165, 299 

definition, 14 

illustration, 29, 147 
Hutchinson metric, 349, 351, 406 

definition, 349 



IFS, 2, 81, 83, 84, 85, 87, 88, 90, 91, 93, 

94-101, 136, 140-145, 146-148, 149, 165, 
169, 182-183, 203, 207, 210, 211-213, 214, 
215-216, 218, 223, 226, 232, 234, 236, 
240-243, 245, 246, 253, 266-267, 270, 278, 
280, 282, 304-306, 352 
definition, 81 
hyperbolic, 149-152, 159-161, 168, 170, 183, 
185, 186, 199, 202, 209, 210-211, 215-216, 
218, 219, 232, 243, 252, 254, 272, 274, 298, 
299, 301, 304, 313-314, 318, 350, 352, 
360-363 

hyperbolic with condensation, 92-94, 119 

definition, 91 
illustration, 96, 98, 100, 112, 114, 187, 231 
lifted, 154, 168 
definition, 154 
Random Iteration Algorithm applied to 

computer program, 213-214, 234-236, 238 
with condensation, illustration, 92, 93, 94 
with probabilities, 331, 332, 335, 343, 345, 
351,354, 359, 360, 364 
illustration, 362, 363, 364 
invariant measure of, 352 
computer program, 370-372 
illustration, 373 
see also iterated function system 
IFS code, 219, 376, 377, 378 
for a measure on B, 331, 332, 334 
illustration, 332, 333, 334 
for Black Spleenwort, 99 
for Castle fractal, 186 
for Fern, 86, 90, 102 
for Sierpinski triangle, 85, 89-90 
for ■, 238, 240, 24$ 
for Square, 86 
for Tree, Fractal, 86 
image animation, 111 



InfS, definition, 33, 74-75 
infinum, definition, 33, 75 
Jx f d\i., definition, 348 
integral, 348 

definition, 345, 348 

of a continuous real-valued function, 4 
interior, 25 

definition, 23 

illustration, 24, 25 
interpolation function, 209, 210, 229, 233 

definition, 208, 229 

see also piecewise linear interpolation 
function 

interpolation points, 218, 223, 229 

definition, 209, 229 
interval [a, b], definition, 9 
invariant, 15, 165, 280, 288 

illustration, 16 
invariant measure, 360-363, 365, 366, 372, 376 

definition, 334-335 

illustration, 374, 375 

of IFS with probabilities, 359, 360, 361, 366 
definition, 352 
illustration, 363, 364 
invertible, 14, 49, 58, 68, 149, 150, 153-154, 
158, 184, 217, 299 
definition, 42-43 
invertible transformation, 65, 73, 159-160 
iterated function system, 2, 330 
definition, 80 

with probabilities, 330-331 
definition, 330 

see also IFS, with probabilities 
see also IFS 
iterates, 47 
backward, definition, 43 
forward, definition, 43 



jet exhaust, 3, 205-206, 207 

see also jet flame 
jet flame, 191-193 

illustration, 193, 194 
Julia set, 3, 61, 256, 258, 259-263, 266, 267, 
270, 271, 272, 273, 274, 275, 276, 278, 280, 
281, 282, 287, 297, 310, 312, 313, 314, 317 
definition, 256, 275 
hyperbolic, 271 

illustration, 269, 275, 277, 279, 283-286 
318, Plates 13, 16 
Julia set, filled, 255, 258, 259, 260, 263, 266, 
271, 272, 289 
definition, 255-256 

illustration, 259, 260, 261, 262, 315, 316 
just-touching, 116, 117, 125, 126, 142, 149, 
151, 152, 157, 161, 165, 183, 184, 185, 199, 



200, 211, 224, 238, 263, 274, 319, 320 
definition, 125 
illustration, 151 
just-touching collage, 103-106, 113-114 
illustration, 105 



landscape, 377 

illustration, Plates 21, 22 
Leaf and Sunflower, condensation sets, 

illustration, Plate 29 
leaf, 52, 53, 65, 97, 99, 103, 155, 377 

illustration, 52, 96, 103, 336, Plates 23, 24 
Lebesgue measure, definition, 344 
lily, 324 
Lim inf, 177 
limit, 16, 18 

definition, 16 
limit point, 19 

definition, 19 

illustration, 18 
Lim sup, 177, 181, 182 
linear fractional transformation, see Mobius 

transformation 
linear space, 5, 6 

linear transformation, 51, 54, 57, 78 

illustration, 51, 52, 78 
local iterated function systems, 380 



^11, definition, 315 
o\\ (A,p, e), definition, 198 
^» (A,p), definition, 198 
Mandelbrot, B., 1 

Mandelbrot set, 4, 301, 302, 303, 304, 305, 
306, 312, 313, 314, 317, 318, 324, 327 
definition, 301, 312 

illustration, 301, 303, 304, 305, 306, 307, 
308, 309, 310, 313, 315, 316, 317, 318, 319, 
329, Plate 16 
map, 3, 50, 83, 97, 128, 128, 141, 152, 154, 
165, 183, 186, 200, 203, 207, 209-211, 
212, 233, 240, 243, 263, 268, 271, 272, 274, 
278, 288, 289, 294-299, 301, 302, 308, 309, 
310, 317, 319, 324, 326, 361, 363, 376 
illustration, 56, 57, 59, 60, 65, 295, 296 
Plates 12, 14, 15 
mapping, 43, 58, 59, 62, 63, 64, 147, 317 
see also contraction mapping 
Markov chain, 392-393 
Markov operator, 350, 351, 352, 354, 360, 
406-409 
definition, 350 

illustration, 353, 354, 355, 356, 357, 358, 359 



measure, 4, 332, 335, 336, 341, 342, 343, 344, 
345, 347, 350, 351, 352 
definition, 341 
Mercator's projection, 24, 278 

illustration, 280 
metric, 11, 14, 29, 78, 99, 215, 218, 229, 232, 
272, 344, 349 
compact, 146 
definition, 10 

equivalent, 12, 215-216, 273 

definition, 12 
Euclidean, 11, 12, 14, 45, 58, 77, 173, 

175-176, 180, 195, 198-200, 210, 215-216, 

219, 246, 256 

illustration, 18 
illustration, 11, 13, 18 
Manhattan, 11, 12, 58, 77, 161, 180, 207 
metric equivalence, 24 
metric space, 2, 3, 10-41, 43, 68, 77, 79, 81, 

92, 111, 147, 148, 165, 167, 171, 172, 173, 

174, 177, 180, 181, 195, 198, 199, 213, 217, 

219, 238, 288, 297, 341, 342, 343, 344, 345, 

348, 349 
closedness of, 2 

see also closed; closedness 
compact, 4, 38, 75, 77, 78, 84, 122, 146, 272, 

299, 340, 344, 347, 348, 349, 350, 351, 

352, 359 

illustration, 76 
compactness of, 2, 77 

see also compact; compactness 
complete, 17, 20, 27, 31, 34, 75, 80, 102, 104, 

111, 122, 123, 172, 180, 216, 229, 232, 237 
completeness of, 2 

see also completeness 
connectedness of, 2 

see also connected; connectedness 
convergence of, 2 

see also convergence 
definition, 15 
equivalence of, 2, 15 

illustration, 21, 29 
equivalent, 12, 13, 14, 18, 19, 21, 22, 

126-129, 145-147 
illustration, Plate 1 
openness of, 2 

see also open; openness 
subsets of, 2, 15 
transformation on, 2 

see also transformation 
metrically equivalent, 145, 146, 165, 171, 

177, 200, 300 
definition, 145-146 
illustration, 16 
minimal period, definition, 132 
Mobius transformation, 2, 49, 50, 58-61, 72, 



74, 75, 83-84, 131, 287 
definition, 49, 58 
illustration, 59, 60, 61, 133, 134 
model, 29, 159, 173, 190, 287, 312, 347, 373, 

376 

Monterey Coast, 377 
illustration, Plate 26 
mountain, illustration, 106 
Myreberg point, 314 



jV(a, e), definition, 172 
natural scene, 374 
nested, 43^44, 279 

Newton transformation, 280, 281, 282, 317 

definition, 276 

illustration, 287, Plate 14 
Newton's method, 4, 276-287 
Non-Wavering Domain Theorem, 278 
norm, of a point, definition, 58 
norm, of the linear transformation, 

definition, 58 
numits, 249, 250, 252, 254, 320, 331, 370 

definition, 248 



one-parameter family, 130, 136, 149, 256, 272 

illustration, 230, 231 
one-to-one, 14, 43, 68, 140, 146, 157 

definition, 42-43 
onto, 14, 43, 68, 120, 124, 125 

definition, 42-43 
open set condition definition, 125 
open set, 62, 268, 289, 290 
open, 21, 23, 257, 288, 340 

definition, 20 
openness, 2, 15 

orbit, 2, 130, 131, 132, 134, 135, 136, 140, 153, 
156, 157, 158-163, 165, 167-169, 247, 255, 
257, 258, 268, 276, 277, 278, 287, 288, 310, 
313 

chaotic, 164-165, 169 
definition, 130 

illustration, 141, 142, 143, 144, 165, 166, 248 
outer measure, definition, 341 
overlapping 116-117, 119, 125, 126, 149, 151, 
152, 153, 155, 162, 183, 186 

definition, 116-118, 125 

illustration, 121, 152, 155 



+ (for set and real numbers), definition, 33 
parameter space, 3-4, 296, 297, 298, 299, 301, 
304, 306, 308, 310, 314, 317, 320, 326, 327 
illustration, 297, 301, Plates 17-20 



parameter space coloring, computer program 

for, 311-312, 314 
parameters, 23, 104, 106, 212, 233, 234, 271, 

272, 300, 301, 305, 306, 314 
continuous dependence of fractals on, 2, 

101-114 

continuous dependence of invariant measure 
on, illustration, Plate 25 
partition, definition, 347 
partition, diameter of, definition, 347 
path, 24, 26 

definition, 24 

illustration, 26 
perfect, 19, 44, 83 
perfection, 15 

definition, 19 
period, definition, 128 
periodic address, definition, 128 
periodic point, 129, 132, 274 

definition, 128, 132 

see also attractive periodic point; repulsive 
periodic point 
periodic, eventually, 133, 136, 141-142, 147, 
152 

definition, 129 
Pj & L, definition, 360 
Photocopy Machine Algorithm, 382 

illustration Plates 37, 40 
piecewise linear interpolation function, 210, 
212, 222 

illustration, 210 
point*, illustration, 6, 7, 18 
Point at Infinity, 255, 257, 258, 268, 272, 
310, 313 

polynomial transformation, 46, 49, 67, 258, 270 
definition, 46 
illustration, 47, 48, 50 



quadratic transformation, 61, 62, 64-65, 67, 74, 
253, 257, 266, 280 

illustration, 63, 64, 65 
quadree 

definition, 118 

illustration, 119 



R, definition, 54 
[R, definition, 5 
R 0 , definition, 54 

Random Iteration Algorithm, 84, 85, 87-88, 
89-90, 130, 165, 168, 169, 213, 267, 271, 
272, 330, 332, 334, 335, 347, 352, 365-366, 
376 

illustration, 113, 267, 268, 269, 332, 333, 334, 



335, 336, 337 
Random Iteration Algorithm, computer pro- 
gram, with interpolation and vertical 
scaling factors, 213-214 
Random Iteration Algorithm, computer pro- 
gram, 89-90, 210 
random shift dynamical system, 149, 153, 156, 
158, 159, 160, 161, 168, 167-168 
definition, 153 
illustration, 153, 163 
rational numbers, 167 
definition, 122 
rational transformation, 66, 271 
recurrent iterated function systems, 379 
Chaos Game for, 385-388 
Collage Theorem, 392-403 
Photocopy Machine Algorithm for, 388-392 
reflection, 59 
definition, 54 
repelling sets, 3 

repulsive cycle, 2, 141-142, 275 

definition, 132-133 
repulsive fixed point, 132, 141, 147, 247 

definition, 132-133 

illustration, 133-148 
repulsive periodic point, 132-133, 278 
Riemann sphere, 8, 43, 255, 268, 272, 278 

illustration, 8, 256, 270, 273, Plate 14 

Mbbius transformations on, 58, 61, 64-65 
rotation, definition, 54 



So, definition, 23 

scaling factor, 183, 184, 186, 199, 238, 263, 
302, 318 
definition, 54 
illustration, 56 

see also vertical scaling factor 
semigroup, 73, 74, 77 

definition, 73 
sensitive to initial conditions, definition, 167 
sequence, of intervals, 45 

illustration, 45, 46 
set dynamical systems, definition, 136 
set-valued inverse, definition, 65 
Seurat, 376-378 

Shadow Theorem, 2-3, 154, 156, 157, 158 
illustration, 156, 157 

Shadowing Theorem, 3, 159-160, 161, 162, 169 

illustration, 162, 163, 164 

proof, 160 
shear transformation, 212, 213-215 

definition, 212 

illustration, 212, 226 



shift dynamical system, 2, 140, 142, 146-147, 
148, 149-158, 159, 160, 161, 162, 163, 164, 
165, 167, 168, 169-170, 246, 252, 254, 273, 
274, 276, 319, 327 
definition, 140 

illustration, 141, 151, 152, 153, 247 
lifted, definition, 154 
see also deterministic shift dynamical 
system; random shift dynamical system 
shift operator, definition, 74 
shift transformation, definition, 140 
Sierpinski space A , definition, 9 
Sierpinski triangle, 9, 38, 49, 54, 62, 65, 77, 84, 
87, 91, 115-116, 118, 119-120, 122, 149, 
152, 161, 176, 184, 198, 200, 243, 246-249, 
250, 252, 259, 289, 332, 336, 341, 345-347 
IFS code for, 84-85, 88-90 
illustration, 18, 50, 63, 65, 88, 89, 117, 150, 

177, 199, 245, 247, 248, 335, 346 
table, 342 
2, definition, 8, 122 
a-field, definition, 340 
sigma-field, 340-341, 342, 343, 345 
sigma-field, generated by g, definition, 340 
similitude, 54, 58, 59, 60, 62, 65-66, 67, 68, 74, 
75, 97, 98, 99, 113, 183, 186, 199, 200, 238, 
263, 302, 318, 326 
definition, 54 

illustration, 55, 63, 96, 98, 103, 186 
simple function, 344, 347, 348, 350 

definition, 344 

illustration, 346 
slide and fold dynamical system, 136 

illustration, 138 
space AT, 5-10 

definition, 5 

points of, 5 
space, see metric space 
space-filling curves, 238-245 

illustration, 241, 242, 244, 245 

IFS code for, 238, 240, 243 
Square, IFS code for, 86 
strange attractor, see attractor, strange 
stretch, squeeze, and bend dynamical system 

illustration, 131, 132 
Sunflower Field, 377, 378 

illustration, Plate 30 
Sup S, definition, 33, 75 
support, of a measure, 344 

definition, 344 
supremum, 200 

definition, 33, 75 



topologically conjugate, 147 

definition, 146 
topologically equivalent, 146, 152 

definition, 145 
topology, 13, 15 

illustration, 13 
totally disconnected, 115 

definition, 125 

see also disconnected, totally 
transformation, 2, 29, 42-1 14 
affine, 2,42,43,45 
see also affine transformation 
contractive, 75 
definition, 42-43 
illustration, 21 
invertible, 43 

Mobius, see Mobius transformation 
rational, 42 
real line, 42-49 

see also affine; analytic; condensation; 
continuous; invertible; Mobius; Newton; 
one-to-one; onto; polynomial; quadratic; 
rational; and shear transformations 
transitive, 167 

definition, 167 
translation, 45, 51 

illustration, 52 
tree, 38, 93 
illustration, 92 



Tree, fractal, IFS code for, 86 

illustration, 93 
turbulent jet, see jet flame 
Twin-Dragon Fractal, 306 

illustration, 310 



uncountable, 120 
definition, 120 



Vector Recurrent Iterated Function System 
(VRIFS™), 408-409 
illustration, 409, Plates 37, 38, 39 
vector space, see linear space 
vertical scaling factor, 212-215, 223, 227 
vertical scaling factor, definition, 212 



web diagram, 136, 148, 163-165, 260 
computer program, 135-136 
definition, 135 

illustration, 134, 137, 148, 163, 166 



X, definition, 8 
x/, definition, 71 
\X\, definition, 
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Plate 4 

A truly four-dimensional fern. As we rotate 
this weird object in the three dimensions in 
which we live, what will we see emerging 
from the fourth dimension? Computed 
using 4-D Fernmaker™ . 






Plate 9 

The Escape Time Algorithm applied 
to an IFS of affine transformations. 
Such images can be magnified 
enormously, without loss of mystery. 



Plate 10 

Primitive art? 







Plato 1 1 

The Escape Time Algorithm is applied to an overlapping 
IFS of two affine transformations. Can such sets be used 
to model sections of plants, as can be seen under a 
microscope? 




Plate 12 

The Escape Time Algorithm is applied to an 
IFS of two maps. Here the magnification is ex- 
treme. Do not race by. Look closely and think. 



Plate 13 

The Escape Time Algorithm is applied to pro- 
duce an image of the Julia set for z 1 - 1 . 




Plate 14 

The Escape Time Algorithm is used to analyze the 
Newton transformation associated with the polynomial 
I 4 — f. Here the Riemann sphere is mapped onto a 
rectangle using Mercator's projection. The North Pole 
at the top corresponds to the Point at Infinity, while 
the South Pole, at the bottom, corresponds to the 
Origin. The roots are a, = 1 , a 2 = -1 , a 3 = i, 
a A - —i. Points whose orbits converge to a, 
are plotted in dark green; points whose orbits 
converge to a 2 are plotted in very light green; 
points whose orbits converge to a y are plotted 
in light green; and points whose orbits con- 
verge to.a 4 are plotted in white. 



Plate 15 

Zoom in on a piece of Plate 14 and change the 
color scheme. Every point on a boundary between 
two colors is actually on the boundary of all four 
colors! 



Plate 16 

Part of the Mandelbrot set 
for ^-A, painted in 
colors according to 
"escape times." The 
green land represents val- 
ues of the complex 
parameter X for which 
the associated Julia set is 
connected. The blue sea 
corresponds to totally dis- 
connected Julia sets. The 
exciting place is the 
coastline, where the white 
foam of the breaking 
waves symbolizes the 
transition between con- 
nection and disconnect 
tion. 





Plate 17 

Part of the parameter space P = {X eC: I XI < 1 } 

for the family of dynamical systems 

f - /(z-iyXifRez>0; 

M; \(z+l)/X*ifRez<0. 

ordered according to the "escape time" of a point 

OeR 1 . 




Plate 18 

An extreme close-up bf 
a piece of the parameter 
space in Color Plate 17. 



Plate 19 

A map of a small piece of the parameter space for 
the family of dynamical systems in Example 4.2 
in Chapter VIII. The map was computed using 
Algorithm 4. 1 in Chapter VIII. The grainy 
multicolored areas resemble the repelling sets 
of the dynamical system for the corresponding 
lues of Xv 




rte 23 

mpare the simple, green IFS 
'actor and the corresponding 
rel measure rendered using 
umnal colors. 





Plate 33 

Black Forest in Winter. 





Www'- 1 






mm 

&9b 



PI 



Plate 39 

A fractal tree. Computed using VRIFS™ and colored 
by PRESENTS™ . 



Plate 40 

A collection of TV cameras pointing 
at TV screens illustrates the idea of 
a Vector Recurrent Iterated Function 
System. There is a Markov process 
on each "screen." Some points are 
spawned and other trajectories die. 
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Vectors of 
Pictures. 



Plate 41 

The transformation W is a 
contraction mapping in the 
space of vectors of pictures. 



Plate 42 

An original photograph of a cafe in 
Valencia, Spain 

was scanned to create a 24-bit per 
pixel file that was 
1,345,960 bytes in size. 



Plate 43 

The decompressed image after 
compression down to 147,557 bytes; a 
compression ratio of 9: 1 . Compres- 
sion and decompression for Plates 43, 
44, and 45 were done using the fractal 
transform as implemented in Images 
Incorporated™ 2.0. 





Plate 44 

The decompressed image after 
compression down 
to 30,173 bytes; a compression 
atio of 44: 1 



Plate 45 

The decompressed image after 

compression down 

to 23,399 bytes; a compression ratio 

of58:l. 
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ABOUT THE SECOND EDITION: 



This volume is the second edition of the highly successful Fractals Everywhere. The focus of this text is 
how fractal geometry can be used to model real objects in the physical world. The new edition fea- 
tures: 

• A new chapter on recurrent iterated function systems, including vector recurrent iterated funcuon syst< 

• Problems and tools emphasizing fractal applications. 

• An all-new answer key to problems in the text, with solutions and hints. 

This edition of Fractals Everywhere is the most complete and up-to-date fractal textbook available today. 

"The material contained in the second edition is quite obviously more extensive in detail and scope... 
the style of writing is technically excellent, informative, and entertaining... . 

- Robert McCarty 

"The problems and examples are well-chosen and interesting... difficult concepts are introduced in a 
clear fashion with excellent diagrams and graphs." 

- Alan E. Wessel 
Santa Clara University 



Fractals Everywhere may be supplemented by Michael F. Barnsley s Desktop Fractal Design System (version 
2.0) with IBM or Macintosh software. The Desktop Fractal Design System 2.0 is a tool for designing 
Iterated Function Systems codes and fractal images, and makes an excellent supplement to a course on 
fractal geometry. 
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